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work necessary to convert one language into another. Each 
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egrating the immediately observable differences into a 
rmula of measurement. (This study of interlingual distance 
th a discussion of language universals and types of 
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INTRODUCri ON 



It has boon said that all languages ore dialects* Not all dialects, 
however, can afford to be languages, unless their speakers are independent 
enough to say they are. The mutual incomprehensibility of German dialects 
like Plattdeutsch and Bavarian has not been sufficient to raise them to the 
status of languages, no more than the inter- intelligibility of their 
northern cousins lias reduced the Scandinavian languages to the status of 
dialects. If mutual intelligibility is a criterion for distinguishing 
language from dialect, at what point of similarity does the one change into 
the other? We do not even have the measurements needed for finding the point. 
And if we did, wo could ask better questions, such as, how far removed is the 
one from the other* This is just one of the needs for developing measures of 
interlingual distance* There are several others in such fields as language 
learning, language teaching, language contact, bilingualism, creole studies, 
and in the geographical and genetic studies of languages * 

In the field of language learning, it has beeft. .commonly accepted that 
some % languages are more difficult than others, especially when seen from the 
viewpoint of the second language learner. 
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simply by looking at a text of 
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y two languages that 







according to the language chosen ; so will the number of differences in such 



it takes, certain persons to reach a certain level of proficiency as measured 
by certain tests* For example, according to a study made for the United States 



or more exactly, if it takes an educated adult native spcc$:qf? of English 720 
hours to achieve a certain level of proficiency in Malay, it will take him 1520 
hours to attain the same level of proficiency in Chinese. 



mine the extent to which elements of the mother tongue may profitably be trans- 
ferred to usable habits in the foreign language. Mint is the degree at which 
such interlingual identification becomes useful? 

In the field of bilingual education, ethnic minorities are often labeled 
by the language they speak - Spanish, German, French, and so on - and it is 
this language that is sometimes decreed as being their normal educational 
vehicle. In fact, however, the variety of the language actually understood 
by the minority may be an admixture of regional dialect and foreign words far 
removed from the international standard that educators have in mind. At what 
point are such dialects worth salvaging as media of instruction? In other 
words, if instruction is to be given in a language other than that of the 



things ns word order, agreement, and length of the equivalent elements in on: 



own language* 






In the teaching of foreign languages, it has always been difficult to deter- 
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country* is the local variety of that language nee 
standard to make its use worth while? Or is it so 
become more of a hindrance than a help? Again we 



2.' enough to an educated 
far removed that it will 
do not have a way of fintlin 



E 



out the degree of difference to enable us seriously to investigate the problem. 
As for the bi lingual parents/ to what extent is their speech influenced 
by one language more than the other? And in the case of actual language mixture 
and crcolization , to. what extent is cadi language related to the parent tongue? 
lVhat sort of mixture is it? And what arc the proportions of the mix? 



Translation studies, interpretation research and the field of comparative 
stylistics could eventually benefit from a quantification of their methods of 
confronting two different languages. 



In the study of language relationships, it is useful to discover the extent 
to which geographically adjacent or genetically related languages have come 
together or diverged; but we would need standard measures of the extent of such 
similarity or difference. 

It vjill be seen later that many of these questions are related - that, for 
example, interlingual intelligibility is a function of congruity, which in turn 
is a function of bilingual interference, and that both are related to the process 
of interlingual distance. Indeed, research into all these areas could -benefit 
from accurate measurement and comparability. It is for the purpose of advancing 
the design of such research that the elaboration of techniques of interlingual 
measurement was undertaken. 

The first problem is to find out the extent to which one language or 
dialect can differ from another. This will depend on our knowledge of how 
languages differ and of what they have in common, that is, on the universals of 
language. It also will depend on what we mean by interlingual distance and what 
we consider to be representative measurement. Our immediate concern will, there- 
fore, be with universals, types of distance and techniques of measurement. Let 
us first examine the question of universals. 
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LANGUAGE UNIVERSALS 



1.1 Man: Mind, Body and Environment 

1.2 Lon gurige: Code, Discourse and Communication 

1.3 Univcrsals of Human Speech 

1.3.1 Characteristics and Constraints 

1.3.2 Common Consonants 

1.4 Univcrsals in Interlingual Distance 
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1AYGUAGE IKT VERSALS 

A central problem in the study of language universal* is to find out how 
languages do not and cannot differ. To find out how languages do not differ 
would require a systematic examination of all the languages of the world. To 
find out j on the "other hand, how languages canno t differ would require an inquiry 
into the nature of man in relation to language and the nature of language in 
relation to man. 

Since we are not yet prepared to examine systematically all the languages 
of the world, we will have to forego our examination of what: all languages 
have in common and concentrate on the question of what all languages must 
have in common. We can attempt this by postulating the character! cts which 
all human beings must share and what all languages as languages must comprise. 

Wc can then study the relationship between these universal* and make use of 
the results to distinguish different types of language distance and possible 
ways of measuring them. Let us first consider the postulates on man and his 
language. 

1.1 Man : Min d ? body and En v ironment 

It would seem plausible to assume that all human languages have common 
human traits. If all men are essentially the same in body and mind, are there 
'certain physical and mental characteristics related to language which can be 
said to be universal? 

We can perhaps make the following general assumptions about man and his 
speech : 

1. He can hear the sounds he uses and make the sounds he hears. 

2. He can put sound groups together into texts. 

3. He can direct these texts to different hearers and modify them 

to suit different hearers. 

O 
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He can associate the sounds he hears and learns with perceptible 
differences in his environment . 

5, These environmental and sound features he groups into functional 
units . 

6, The grouping is done for him by the language he hears, and they 
become conventions to which he is bound, 

7, He also learns to associate analogous grouping with the non- 
environmental workings of his mind - with logical and conceptual 
processes. 

8, In this way, all man’s mental activities become associated with 
these conventional groupings of sounds and concepts - his pcrcopti on 
of the sound sequences and what they stand for, his conception of 
them into equivalent categories of thought, his cognition of the 
relationship between them, his evaluation of the speech effects 
upon persons he hears and addresses, his use of these relations 

and evaluations to find solutions to lii.s questions his memory of 
all tills, and his use of what is thus stored in anticipation of 
future thought and expression. 

If certain dements of man's mind and body are indeed the same whoever 
his is or wherever he maybe, then certain association of his mental activities 
in language with his physical actions in speech may well be universal. There 
are, for example, certain limits imposed by the make-up of man’s thought and 
speech; he must function in a time sequence and his speech must, operate 
within the limited capacities of the human anatomy - excluding as they do, 
for example, such anatomically impossible sounds and api co-pharyngals . 

If, in addition to this, all environments in which man is able to exist 
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have something in common, some of these' features may also be universal. Man, 
in order simply to exist must eat, drink, sleep, act and reproduce. And we 
can consequently suppose that all his languages rust have labels or concept 
categories which include these essential activities. We can therefore say 
that some characteristics of man's mind, body and environment are universal 
by their very nature. We can also assume some universal characteristics which 
all languages by their very nature must share . 

1.2 Languag e : Code, Discours e an d Comm uni c at :1 on 

All languages must include certain code, discourse and cOmmuni cation 

1 

components. The codes of all languages must be composed of symbols and of 
procedures (or rules) for putting them togethe r to generate messages (texts or 
discourse) usable for purposes of communication. All codes h&ve certain 
essential elements in common. Although their elements must all be finite, 
their generating capacity may be infinite. Their elements are 'not things in 
themselves; they are representatives (or symbols) of things . i These symbols 

i 

> 

may represent universal operations of the mind*, like identify (is) and negation 
(not), universal features of the human condition (experience and action), 

i 

universal features of message production (reference to something said), and 

i 

i 

universal features of all actions of communication (person jspeaking and person 

spoken to), hack code contains the symbols it needs to operate. All codes 

i 

need symbols for (1) logical and human operations, (2) discourse features and 

\ 

(3) communication variables. Each symbol may have one or many forms expressed 

.» 

i 

in the elements which all language have in common. Cacli type of symbol may 

i 

be related to the essentials of man's mind, body and environment . The possi- 

j 

bilities of these relationships may be seen by plotting oncj set against the 
other. (See Table 1) 
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AREAS WHERE UNI VERS ALS MAY BE SOUGHT 



Man 

Language 


. KIND 


BODY 


i ENVIRONMENT 


CODE . 


3 


2 


3 , 


DISCOURSE : 


; 4 


5 


6 


COMMON 1 I CAT I O’v 


7 


S 


1 ' 9 ; 



In each of these nine -areas,' we may ask wMch ; features fire universal* 

What are the universal characteristics of 1 * trains format ions;; (propositions ,etc. ) v 
2. articulation (phonemes^ word forms ,etc. ) , 3. categorization (size, mimber,ct:c r ) 
4* utterance (identification > substitution , reference , voice ,etc*) , 5. catenation 



( as s i mi 1 at i on , di s t r ib ut i on , 
etc .) 3 7. style (mood, forms 
(intonation, tempo, etc.), 9 
(spoken, written). 



etc.), 6 . collocation (compounding, word linking, 
of address, ctc.)> Sj- prosodic llexp res si on , | !: 

register (formality! slang, . etc.)’, me cti 



1,3 Uni vers a Is of Human Speech .. . .j, ' !'• . . /; , • f. i 

iVli at is universal about man's impressions find the 'way he labels them tp 
produce speech? Is there something common in the jphysi cal sources of these 
impressions, or the process of : applying labels to Jthem? Axe there common : 
components applicable to all human speecli? All hijnnan speech may have ;;;so 3 Ee thing 
in common in the processes which produce the characteristics and constraints 
of any language and the components common to all of them. 

• i. . '< ■' 

■ • j - 

1.3.1 Giaracteristic.s and Constraints. . I 



It would seem evident that there must be symbols in the minds of men to 
enable them to talk about 'what all men have in common - their bodies and their 
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needs ** eating, drinking, walking and sleeping. Although these symbols may 
well he universal, their characteristics are not necessarily so. 

Although words for parts of the body, for example, may well be universal, 
there is no universal agreement on what can be considered as a part of the 
body. Some of the Slavonic and Celtic languages , for example, label the foot 
and the leg as one part - Serbian nog a; Erse cos A :, 1 They .may likewise combine arm 
and hand - 'Russian ruk a . Some languages label the extremities with the same word 
Spanish dodo for both toe and finger. Like other , languages , however, they can 
specify by using phrases or compounds , In Spanish, they can say, log d odos de- 
le s pi es. In Korean, however, they say, palkarak (toe) and s onk arak (finger). And 



in this language, as in others, some parts of the body may be extended or be 
extensions' of a more general category; as in the above example, toes, fingers, 
spoons and other instruments can be labeled as a .certain kind of stick ( kayak ) : 
palkarak (toe), sonk arak (finger), snick arak (spoon) . c holic arak (chopstick). 

This may seem unusual only if it is someone clse’s custom; English, French and 
German and other European languages have similar extensions. Tn English, there 
are the hands of the clock, the face of the clock, the foot of the table, the 
eye of a needle, where French .uses needle ( aiguille) of a clock instead of hand. 
But like English, in other contexts French makes a distinction between needle and 
pin ( Qnin hle ) , whereas German has a more inclusive n ade 1 label for both, using 
compounds to distinguish between needle ( Naim ade 1) and pin ( Stechnade 3) , 

Many but not all of these metaphorical processes touch the grammar of 

i. 

languages and they may be formalized in any of tile grammatical features such as 
position, word-ending or formal marking, like affixation. The distinctions may 

; ■ . , j i 

be logically formalized but arbitrarily applied; for not all languages apply 

ji • j . 

logical categories in the same way. We may think, for example, that the dis- 
tinction between person and thing is universal; but we need only compare the 

11 ? 






( il, elle) or in its articles fie, la, xrn, me); English docs (he* she., it) , 



•but not always , not in its articles (the , a). The uni vers a. Is of logic are. not 



the uni vers a Is of grammar. 

)Ve must distinguish between . what a language has and how a language 



functions. Most language seem to have logical categories; but they are not always 
logical in their use. A concept categorised as a thing in one language* may be 

classed as an action in mother language* Time in one language may be treated 

as space in another ( A long road takes a long tim e to cover .) Because of this 
arbitrariness of languages in the use of universal logical categories, it would 
be hazardous to guess which grammatical features are common to all languages. 

Difference in categorization appears as a constraint on the conventions of 
the language. It is as if those who speak it have agreed to a package deal 

which they must take or leave. Such constraints arc typical of all human organiza- 

tion, jf a pharmaceuti cal chemist wanted to open shops in France, Germany, Canada 
and the United States, in France and Germany he- could limit himself to the sale of 
medicaments; in the United States and Canada he would also have to deal in 
cosmetics, magazines and a host of other products. Contrariwise, if his business 
man friend wanted to sell soap in these countries at the retail level, he would* 
have to open a perfume store ( perfume rj e ) in France, a non-prescription Pro gen ie 
in Germany and a pharmacy or drugstore in the United States and Canada. Or 
if a man were interested in telegrams, he would be dealing with the post office 
in England, Frail ce and Germany, with the railway companies in Canada, and with 
a telegraphy company in the United States. But if he were involved in inter- 
urban telephone communications, althou fo . he would also be dealing with the post- 
office in England, France and Germany, he would have to associate with utility 





10 



companies in the United States and parts of Canada. In a similar way, the 
package deal of language categories functions as a binding feature - not only 
semantic, but formal as well. It also extends down into the neurological median- 
isms of speedi , operating as associated nerve reflexes. Native speakers of 
English, for example, when they utter vowels involving action of the back of the 
tongue are subject, through mutually conditioned reflexes, to the constraint of 
lip rounding at the same time. The reflex prevents them from disassociating the 
one from the other, making it difficult to pronounce foreign vowel sounds whore 
the one is not bound to the other, as for example, in making the proper dis- 
tinction between such French words as rue and roue (see below). Bound reflexes 
are the manifestations, at all levels, of the 'way a particular language groups 
the features of sound and meaning into the elements of its code. 

Even though all languages may have labels for nouns, verbs and qualities, 
they do differ in the features of the environment which they can categorize under 
each of these labels, that is, in what is considered to be a thing (noun), mi 
action (verb), or a quality (adjective and adverb). This ,r as if 11 process touches 
all levels of language, while, from language to* language , it is differently 
applied. 

It may be that the more abstract or general we get when talking about language 
the more common diaracteri.stics we are likely to find. All languages have 
sentences and can express propositions. All languages label things. The way 
these labels are put together in propositions may have been determined at the dawn 
of human speed; by the fundamental didiotomies of man as he evolved into a tool 
user. It has been suggested that when man succeeded in establishing a dichotomy 
between the working or hammering hand and the passive or holding hand, he paved 
the way for other didiotomies in human thought and speech. Did this also enable 
him to make an analogous dichotomy in the types of impressions that readied his 

13 
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senses and the grouping of them into the stable or holding ones (the nairas * the 
subject), and the hammering or operational ones (the actions, the predicate), 
and to generalize this, in cognition, into all topics on the one hand and all 
comments about them on the other? It has been observed that the distinction 
between actor and action in child utterances occurs about the time the child 
lias learned to make a distinction between one hand and the other. There seems 
to be a parallel between the child's physical development and its linguistic 
development. Systematically pairing physical and language observations in the 
development of children of a!3 languages might therefore throw 1 some light on 
the existence of language universals, and on the common components of code, dis- 
course and communication. 

1.3.2 Commo n Componen ts 

Axe there components which all language codes have in common? Are there 
also common components in the discourse and communication aspects of language? 

To what extent are language codes alike? Are there latent systems and structures 
in the very makeup of the human brain which arc transformed by the growing child 
into the system of a particular language? If there is a universal grammar, 
does it have a biological counterpart? It seems that the nature of the mind's 
symbolic behavior may be universal; in language) it stores and operates a -systems 
of symbols so organized as to permit productivity and efficiency in communication. 

Although these symbols are related to the environment which the code of 
the language divides into concept categories, some of the symbols are so general 
as to relate to features common to all environments, some of them probably inate 
to the very workings of the human mind - cases like the impossibility of a 
thing being and not being at the same time and under the same circumstances, 
negation, distinctions between things and qualities, actions and states , place 
and time, actor and object, container and contained and the numerous possible 
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results, wo could p:- s i?.:- - : that they r > 1 s o r:.;uc?!i such 30 lati c-uj li ips rs ■'I.no did 
what to whom, hho did what elc.’ r IT thin is t ni? , certain ease concepts 

like the accusative, dative, ]ecati\c, causative and temporal, wo natter how 
expressed, nay rl-n he universal. 

V.'e can pc di ajv assume t li ;• 1 a]J Ian, neg s have r.vci: Meat i ra I roe:v for dis- 
tinguishing action:. from the causes of such actions end their effects. That is, 
they have verb -li he categories and noun-liko categories, ;uul also perhaps quality- 
like categories. I ut how far can we go in these speculations on the uni versa 3 s 
of language codes? 

Can we say as much about the common components of di scorn sc? There nay be 
universal s of discourse, such os the necessity of inferring to something said 
before, .and the need to substitute a shorter form for a clubbers ore repetition. 

Here again laj3guag.es differ in how they satisfy this universal weed. Trench 
will put previous lefc-rencc into the s 0.0 category’ ws its deictic ( fet her re...) 
in c as e s who re V. n g 1 i sh w i 1 1 con s i do r it as de f i n i t i on (1 hc_jan 1 \ . . ) . All 
languages haco wavs of putting together their concept and grernat i cal c at e go lies 
to yield messages - ways of connecting words in collocations (i. . gl rss _of_ vi \Kf 
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pui t i es . 





If the. uni Vs*. \ ■ ;»3 vomit i» of loj'ic affect the. discourse of languages, 

so tin the \\ i i vr ]*:* ii ] i' of coi.v.uni cal i r n . in o: del to tvihe language fnncVi c.j as 
a vehicle o i? com; iu ii cot i on between tvo individn. is, it my be argued llint. oil 
languages must make distinctions between niiostn is and non-quest i ons , between 
the person spoken to and tlic person speaking, w;ys of expressing tbe altitude of 
the one toward tlic other. The attitude toward t lie person spoken to my be ex* 
pressed by the type or length of the f o rn, c at egori zing people in di fie rent 
v;:tys(e . g. , f. tu, vom/ (». Pn , Sic). Pel nt i cash ip to the person spoken to r.iv 
also be conveyed in tlic assn: ipt ion ::nl use o r shaved syrbols ns reflected in 
slang and technical jargon. Similar processes i; ay be used in different 1 orgvru.es 
to reflect the attitude of tlic speaker to what lie is talking about. If the 
speaker is comm nl ing on the fact that somouc ivfuses to spend his savings to 
buy a new car, tlic dioi co of words like thrift lir avaiycc reflects the sper-kc r’s 
attitude If 1 aid the pcrrcn spoken about. IVic a*, in, the i;.y tkm dii.t * net cas 
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are r.ppj’cd ray be fnr fivu ccn.j si • at , lavy. y.v; rsc l hr- sere foj'M ior 

the person spoken about and to'- person ?. Con lo - (spec) ally a f .it is a polite 
fo rr. J/snyuaycs also have wv*s of oxpre'wT ny vhat is expected of the person 
spoken to - ( tmoif.s ion of orb r my hr. v-ed to illicit 0 statrn.nt froia him 

(Arc yoij?) , a spatial void (ifa* V) or sii ply a change 5 1j the tone of voice 

( i nt 0 r rog a l i vc i n t or 1 : t 1 ion). 

Khat can one conclude* from this into mi on into the cloudy heights of 
speculative linguistic?? It seems that laayuap. may have ft lot in cca.i.i e in 
what they can say, but lhai the)* may differ unexpectedly in the way tJif.y s ay it. 
Without actually an* lysing all 1 e> guares , t V» c- oily certain uni ve rs els are Likely 
lobe found in ;he make-up of \,mi - his body, p; rli cularly hi:* speech organs 
anti in the nature of the discourse he produces a sc<pvn?e of symbols in his 
necessary acts of communication. Of vhr.t vse c : :j these ml versa 3 s i »o in measur- 
ing the d i ,a * i'j'iCC bet >tv n 1 an j .e 0; ,« > ; 



K4 Univcrals in interlingual distance 

How can vc us: any of the universal!* of wind, body ami environment and 
vJiat is common in codes, discourse and cor, v Mini c; t i on to nersurc the differences 
between languages? 

If ve know wlr.it these uni versa! s arc, ve ceil u c c then as pm amrters , Porn 
instance, kc know all organs of r.-:n used in speech and ve know their physical 
limits. These can he used as constants in phonetic mensuren nl . Wc also know 
certain things that all men have in common - tine, space, ports of the body and 
ve can use the number of distinctions nub in each language to measure inter- 
lingual distance, between these two cxtieirs of language, which touch the 
physical world of sound and ivnniig lie a multitude of possible di ffcroiuvs i ii 

syslci j raid st 1 ticl ores . The difiVuM or, .Ml ic.-.I 01 . j ral i ( r* of Lingupo 
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happen to be universal. Not knowing this, we can cither assume that there arc 



fashion,, we can extract the constants found in the grammars of all languages. 

Meanwhile, these constants - if there arc any - of how languages can but 
do not differ may constitute a uniform error in all measurements of intergram- 
matical distance. For example, if both French and English have distinctions in 
their grammars between actor paid action, this common feature, if taken into 
account, reduces the distance between the two grammars. If, however, this dis- 
tinction turns out to be universal, the distinction has no longer any differ- v 
ential significance. 

Certain universal characteristics of man, thought and communication, .can 
therefore be used as measurements of interlingual distance. Those can be broken 
down into distinctive features very much, like a television signal is analyzed 
and synthesized to reproduce a pictuire on a screen. In this way, each phoneme 
can be broken down into distinctive' sound features, just like each concept cate- 
gory can be considered as being made up of a number of semantic components. 

Combinations, sequences and modification of these features may be permitted 
or obligatory, in both what must be expressed and the way in which it is. 
Combination of lip rounding and vowel fronting, as noted above, are permitted 
in French /yf, but not in English, where lip rounding makes tongue retraction 
obligatory /u /. ' Sequences of consonant and /-s/ are permitted in English 
( pillars, plural s); but not in Spanish ( pilares, plurales ) . These constraints 
are part of the significant differences between languages and would have to be 
taken into account directly or indirectly in any complete measure of language 
distance. 





i 



18 



16, 



In what way then do the universal characteristics of man and his language 





the types of measurement. Since all languages must have some code, discourse 



and communication features, it is these that will have to be taken into account 



in di vising types of ure men t. The distance between the codes will not bo 



the same as the distances in the discourse the}* produce; and interlingual 
distance in oral communication will not be the same as it is in written com- 



from their kinetic counterparts in the stream of speech. 

These universal distinctions of language relate in turn to the universal 
characteristics of man. The human body is the vehicle and producer of speed) 
and as such can be used to measure the distance between speed) codes. The 
universal characteristics of man's environment are relevant insofar ns they arc 
accounted for in the language codes. The universal characteristics of mm's 
mind, its symbolic processes , its. mechanisms for transforming code into discourse, 
and even its tolerance of inconsistency, will affect any attempt to measure the 
differences between languages. 

Between these abstract universals of the human mind and the concrete 
univexsals of the human body, lie the multifarious diversities of the languages 
of the world. The universals enable us to study the diversities by revealing 
all possible types of interlingual distance and by serving as instruments for 
measuring them. 



muni cation, since the static features on the printed page may differ in many ways 
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2.1 Distance in Code and Discourse 
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. TYPES OF INTERLINGUAL DISTANCE 

The universal distinctions between mind and body, code and discourse, 
speaking and writing, mother-tongue and other tongue and the *diversit y and 
intensity of man's conception of his environment oblige us to consider the 
measurement of interlingual distance from a. nuirbef of different viewpoints. 

These considerations make it evident that distances between languages can 
be measured from different angles and that there arc different types of inter- 
lingual distance. We can distinguish between: 1) distance in the two codes 

and distance in the discourse produced through them, ii) distance as communica- 
tion of static messages in space, as distinct from different! texts unrolling in 
time, iii.) distances as differences between languages, as opposed to distance 
as movement from mother-tongue to other tongue, iv) distance as an analytic or 
taxonomic accumulation of categorical differences, or distance as an over-all 
integrated measure, v) distance as a direct measure of difference, in contra- 
distinction to indirect measures making use of an intermediate language 5 
vi) distance as the diversity, intensity and productivity of the comparable 
differences between the languages. 

Since not all these different types are mutually exclusive, the distance 
between two languages may be measured from a hunber of different angles, as 
will be demonstrated below. But before elaborating techniques of measuring 
interlingual distance from these different angles, let us examine what the 
different types of distance involve. 

( A ) 

2.1 Distance in Code and Discourse: (A X), \X/ * 

As we have already seen, all languages have systems of symbols variously 
referred to as 1 an gue or code, and these axe used in conventional ways to pro- 
duce messages or texts - an activity which has been called by a number of names 
including parole and discourse. Measuring interlingual distance from the one 
is not likely to be the same as measuring from the other. 
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In measuring the distance between Language A and Language X therefore we can 

proceed either from the descriptions of the languages or from samples of them. 

; ,[ 

In other words,, we may measure the distance. (D). (I) in the; codes D(AX) .or ( 2 ) 

s 



in the messages , or samples o£ discourse DjA 



Measurements of the differences between two 



language ; codes depends on o\o 



accuracy and comparability of the descriptions used. If t 



10 infoimtion sup- 



plied is either incorrect or incomplete, for one or both languages, .the con- 
sequent comparison or contrast is likely to bo inaccurate h If the type or 

! . •• !• t 

methods of description (and notation) used are not identical, differences in 

descriptive techniques may appear as alleged differences in the languages, That 

•l . ‘ -'l . : : 

is why some of the contrastive studies which are! based on the two codes have 

■ i . 1 : ; ' 

tlieir source not in the use of ready-made descriptions but; on re- descriptions of 

— t f: 

the languages for purpose of coraparison - or contrast.. This is another reason 

: i . I ■ . • ■ 

why two contrastive descriptions of the same languages or dialects may differ, 



In measuring from discourse samples, however, sets of 



equivalent texts 



with alternative versions are.- needed. But even With nn inventory of all possible 



alternatives , it will be seen that measuring from the message is not the ; 
equivalent of measuring from the code. For one cannot assiinie that different 

codes will produce correspondingly different equivalent texts or that the degree 

T 'j. 

of difference between two equivalent texts corresponds to degrees of difference 

in the codes which generate them. This is evident when -one ! 5 travels from one 

r ! 

area to another area in which the same language is spoken. ;; Th ere are situations, 
for example, in which Englishmen say "Not at all”!, Canadians say "You 1 re welcome, rt 

i . | • 

and Americans say ,r Uh, uh. 1 ' Yet these three equivalent discourse responses to 
the same situation do not indicate that we have three different languages. 

Between two different languages, the different -codes may produce equivalent 
alternatives which are closer than others. For example, the English sentence 

1 i ; 

The -na me is Robert may be rendered with equal correctness in French as either 



Je m 1 appello Robert or Le nom ost , the latter being much closer to the 

English equivalent. This doc me m , however, that the two French sentences 

are interchangeable ; they are, on the contrary linked to mutually exclusive 
situations, the first, being used in introduction, the second in consultation. 
The English sentence is used in both; to do so in French would consequently 
constitute a case of interference. Measuring language differences through 
equivalent texts therefore is valid only where ’both texts operate within the 
same context. 

Sim laxities in code do not necessarily produce similarities in discourse. 
Mien measuring from the code, the English and German vocabularies may turn out 
to-be very close, as it is in the German sentence: Kann (can) man (one) ein 
(one) Bo ot (boat) habe n (have)? But measuring ’ from the English equivalent c>f 
this sentence, the two languages no longer seem so similar; 

G. Kann man ein Boot habe n?/ E. May we rent a boa t? 

Although such code similarities may facilitate comprehension of the other 
language, they may be a source of interference in expression. Tiie codes of the 
two languages may be close, while their respective transformations into 
equivalent messages may be highly divergent. 

Contrariwise, the identity of two equivalent messages does not mean 
that the codes producing them are likewise identical. Sentences like Maria va 
a casa m ay be read as either Spanish or Italian. But if one were to compare 
the two codes which produced this same sentence, the two languages would appear 
as different; Sp. (pres, inch); voy, vas , va, va mos , vuis , van./ Ttal. 

(p re s . in d . ) ; vado, vaj , va, andiamo, andatc , yanno. 
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Spanish 



Itali an 



Yo vov a cas a. 



Io vaclo a casa. 



• Mari a va a casa. 



Nosotros vamos a casa, 



Noi andiamo a casn. 



Teresa y .Maria van a casa. 



Teresa e Maria y anno a casa 




measured from their codes as when measured from their messages. We must conclude 
that neither measure is alone sufficient to give the complete picture; wc require 



distance between codes and distances in discourse, 

2.2 Static Distance and Kinetic Distance: D, _D 

Distance between languages will also be different depending on whether we 



the generation of texts from their respective codes. 

In static measurement (D), language is considered as if it were a reservoir 
of material to be matched and measured. Two languages may be treated as if 
they were two landscapes, each corresponding feature of which is given equal 
consideration, Language can also be regarded as a kinetic process (D) , the 
dynamic generation of texts on an ever-ending time scale. And the point on the 
scale at which elements of languages are equivalent can he regarded' as a function 
of tlieir difference. 

Since discourse is essentially sequential, it can be regarded as unrolling 
in time, so that at no given time is a whole text perceptible. In this way, 
the resemblance of what comes before makes us anticipate what may come after. And 
because of this, the matching elements that come first have more resemblance 
value than those that appear later. For the probability of one unit along the 




measure the static results of language communication ox the dynamic process of 
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time sequence being identified with its counterpart depends on the type and 
number of elements that have already appeared, 

. Since the sequences of phonemes appear, not in space but in time, the 
position of a phoneme in one of these temporal sequences can be weighted. If, 
instead of reading and translating the texts, one were to listen to them, as 
do interpreters, the similarity of elements at the beginning of words or word 
groups becomes more important that it does at the end. This importance is 
also reflected in the natural tendency to form oral abbreviations like math and 
gym - rather than ma t i cs and nasties . The operative factor, however, is not 
sound, but sequential position. Some computerised systems are able to operate ~ 
even with proper names - with no more than five letters per word - provided they 
start counting from the first letter. It is almost as if a series of pictures 
were to appear in sequence, giving us more and more information as the film 
unrolls, in this sense, language is essentially kinetic, and if all its 
characteristics are to be measured, this must be taken into account. 

2 . 3 Distance as P i fference and Conversion : D (7r^X) , D 

We may regard the distance between languages either as the sum of their 
differences or as the amount of change needed to convert one language to the 
other. 

In adding the differences between two languages, one may conceive them as 
opposing forces, or vectors, which can be combined as a measure of divergence 
(differential or vectorial distance). Or one can regard these differences as 
the amount of work or number of operations necessary to convert one language into 
the other (conversion distance). The first, being a combination of forces, is 
non-di recti onal ; the second, being a transfer of operations, is uni-directional. 
Let us examine what the measurement of each involves. 
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2.3.1 Pi f fe ren t i al ' P i n t mice : D (A^ X) 

The measurement of the differential distance between Language A and 
Language X, being non-clirectional, is obtained by the summation of differences 
between both languages y so that D(A^X)~ A - X f X - A. For example, in 



measuring the difference in the system of determiners in French and its counter- 
part in English, the fact, that the. single indefinite article (a) is equivalent 
to the two of French (un , une).i.s tlie same amount of difference as the fact 
that the person determiner in English ( his] is equivalent to two forms in French 
( son , sa) 3 a difference of one in each case. To say that the English (Iris pier] = 
French (son , so) and count a distance of zero would be to ignore the fact that. 

(her sr son , sa ) - (1-2) and (h is = sen , sii) ~ (1-2) , and ( son - hi s , her) - (2-1), 
(sa = hi s 3 her) - (2-1). So that the distance is not zero, but rather, in 
natural numbers: (1-2) + (1-2) + (2-1) * (2-1) = 4, that is, a vectorial distance 



of 4. The vectorial distance bet 






s of singular determiners in 



English and their French equivalent will then be a total of 11, calculated as 
follows: 

E. a tlie this + that my your his ■ + her 

i ^ " i i \ i 

F. un+une le+la cc+cette ce+cette , cc+cette inon+ma ton+ta*votrs son+so sen+sa son+sa 



1 1 12 1 
2 2 2 1 2 
1 + 1 + 1 + 1 + 1 



1 

2 

1 



1 12 1 

1 ill 

2 + 1 + 1 + 1 = 11 



In other words, if we add the differences, between English and French, in the 
distinctions recognized in this series of determiners with those between French 
and English, we get a total of 11. If we were to add all such differences in all 
the semantic and formal elements of these two languages wc would get an idea of 
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how far apart they are . But we would not know how many more distinci.ti.oris 
'would have to he made by a Frenchman learning English in comparison with those 
which an Englishman would have to make if he were learning French. Mere the 
starting point would make a difference, since the distance between A and X 
would not be equivalent to the distance between X and A. he would have to know 
which language features are being converted into which. 

2.3,2 Conver s ion Dist ance : D (A X ) 

It may make a practical difference whether a learner of a. second language 
starts off with a native language that makes a large number of compulsory 
distinctions and tries to master one that does not, or whether he is going from 
a simple language to a more complex one. In order. to- find out this practical 
difference, differential measures would not be sufficient. What is needed is an 
idea of how many extra distinctions have to be made when passing or converting 
from one specific language (Language A) to another (Language X), in other words, 
■we would have to measure the conversion distance. 

The measurement of conversion distance, being uni-clirectional, is obtained 
by computing the number of distinctions not present in the other language. 

In the above example of the set of singula]- determiners in English and 
French, it will be seen that for every English determiner there are at least 
two French ones, one masculine and the other feminine; so that when converting 
from English into French, there is additional distinction of gender each time. 

In addition to this, there is, in one 'determiner (the second person singular) an 
extra social distinction to be made betv/een the general and the familiar forms. 
The conversion distance is the sum of these distinctions that do not exist in 
the source language; difference between the language to be attained (the target 
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language) (T) and the language already acquired (the departure or source language) 



(S) . So that, S T = 


* T - 


S = A - X. 


Compare: 










A (Target) : P. un+une : 


i e-i-1 a 


ce+cette 


ce+cette 


mon+ma 


t on +ta+ votre 


son +5 a 


s on-ms a 


2 


2 


2 


2 


2 


3 


2 


2 


1 


1 


1 


1 


1 


1 


1 


1 


X (Source) : E. a 


the 


this 


th at 


my 


your 


his 


her 


= i + 


1 


+ 1 


+ 1 


+ 1 


+ 2 + 


1 


+ i =< 


0 (sg.det) (E 


F) 


= 9 













These represent distinctions of only two types, one (masculine -feminine) with the 
highest productivity, covering all or almost: all determiners and nouns, and the 
other ( ton-ta , vot re) limited to the second person singular determiner and pronoun. 

Measurement in the other direction, however, gives quite a different picture. 
The French speaker learning English h as to make only a few extra distinctions 
he does not make in his French determiners , but these axe in different semantic 



areas. So that, F. ** E. = 


H. - F. : 








A(Target) E. a the 


this + that 


my 


your 


his + her 


X(Source) F. un le 


ce 


mon 


ton 


son 


uno la 


cettc 


ma 


ta 


sa 


1 1 


2 


] 


1 


2 


- 1 1 


] 


1 


1 


1 


E. - F. = 0+0 


+ 1 + 


0 


1 0 


+ 1=2 


D (sg.det) 


CM 

11 

SU* 








A French learner of English would have to 


make two 


extra 


distinctions which he 



does not make in French, and each of a different type, the first having to do 
with proximity and the second with sex. Both are high intensity items, however, 
because of their frequency of occurrence but since they apply to small classes of 
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words, their item productivity in the code is lower (se e be low*). 



2. A Taxo nomic Distanc e and Integr al Di stance: )!>. D 

One can measure the distance between languages either by taking the dif- 
ferences category by category, that is, taxonomically , or by attempting to 
abstract from them integral measures in order to obtain an over-all picture. 

In the taxonomic measures we have to deal with classes, categories or 
dimensions. Each lias to be considered separately and measured by a procedure 
appropriate to the dimension being analysed. For example, differences i7i 
syntactic relationship such as the concord between adjective and noun, may not 
be measured in the same way as the difference in class membership where the item 
may be an adjective in one language and a noun in the other. 

In the integral measurement of interlingual distance, only the equivalents 
as a whole, and only as they meet the eye or the ear, are measured for their 
ob s e rvab le di. f £e ren cos . 

In the first case, the results of the measurement appear as a sequence of 
figures cadi representing the distance in the category or dimension included. 

In the second, the distance appears as a single integrated figure. 



2.5 Semantic Distance and Formal Distance: D^ v ^ , D(a-X) 

Studies of language have kept separate the description of what is said from 
that of how it is said. The dichotomy has variously been called semantic vs. 
formal and content vs. expression. Without denying that all language features, 
even those of meaning, are formalized in the code, let us follow the conventional 
usage by calling what is said, the content form, ’semantic', and how it is said, 
the expression form, ’formal’ , for want of a better term. 

It will be seen that formal differences between languages are not equivalent 
to semantic differences. A look at a few deceptive cognates will make this 
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clear.. For example, the form location i s the same in written French as it is 



in written English. In the semantic code of French, however, it does not 
cover the same area of meaning as it does in English, since it would sometimes 
have to be rendered in that language ns renta ls , t.o let , and other forms. An 
examination of the semantic equivalents in English and French of such words as 
a driiinistratio n, in format ion , r e Inti on , trails mi ssx o n , page , machine - to name 
only these - would reveal the same distinction between formal identity and 
semantic divergence. 

The distinction, therefore, maybe an important one, since the grouping of 
concept categories is not necessarily connected with the relation between word 
forms. The fact, for example, that languages group words into masculine and 
feminine does not signify that the difference has anything to do with the meanings 
of these words. 

In comparing language codes, therefore, the distinction between semantic 
distance ^ and formal distance will have to be maintained. In 

measuring the distance in discourse, however, the distinction no longer hold; , 
since the comparable tcxt.s being equivalent, the semantic component is a constant. 

2 . 6 Direct Pis t an ce and Indirect Distance 

Interlingual distance may be direct or indirect. If the difference between 
two languages or dialects can be measured directly, there may seem to be no 
point in proceeding otherwise. 

In certain studies , however, where interlingual measurement may be necessary, 
the only significant differences may be indirect ones. For example, if we wish 
to measure the difference in standard of the two languages in a bilingual com- 
munity, we will not find the answers by comparing the one with the other, but 
rather by measuring each against its standard regional or national forms. 
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Or in the study of two mixed languages with a common parent, we can find 
out by indirect measurement the extent of divergence of each and the results 
of the different types of mixture. 

Measurement of interlingual distance may therefore be done either directly 
(D = A - X) by comparing one language with the other, or indirectly by using an 
intermediary (J) language or dialect, as is sometimes necessary in practice: 

D = (A- J) - (X-J) . 

2 . 7 Diversity, Intensity and Pro d uctivity 

In attempting to measure the distance between languages, we may wish to 
know, not only how different the languages are, but also how important are the 
differences, both to their users and to the workings of the languages themselves. 

The distance between any two comparable sets of features in two language;; 
may vary in three respects - in diversity, in intensity and in productivity. 
Diversity has to do with the number of distinctions a language makes in its 
grammar, its vocabulary audits phonetics. Intensity is a measure of the im- 
portance of such distinctions. Productivity shews the number of areas (situations, 
structures and categories) to which the distinctions apply. Distance between 
languages can be measured as the sum of the differences in diversity, intensity 
and productivity of their comparable elements. 

Differences in diversity can be measured by subtracting the number of con- 
stituents of one language from their counterparts in the other. For example, 
in the semantic area of the third person singual pronouns, the difference between 
English and French is 1: ( lie, she , it) - (11 , elle ) = 3-2=1. 

Differences in intensity can be measured by the use of the appropriate 
scale of importance , depending on whether the comparable units are discourse 
features like the .intcrconnectives ( although , since , if , ...) logical abstracts 
(bo , seem. . . . ) or environmental variables (wine, snow) . Appropriate scales 
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may be found in the indices of frequency, range, coverage, availability and 
familiarity. 

Productivity has to do with the degree of generality at which the distinc- 
tion is actually applied. Does it affect a large class of elements (like all 
sentences, or all nouns), or only a small class? Within the class, does it 
affect all members or only a few (as do the foot- feet , goose- gee se plurals in 
English)? French* for example, makes action on self vs. action outside self 
a compulsory distinction, whereas English permits a choice (P. Je me lave./ 

E. I wash , I wash myself ) . This potential of difference in compulsory distinc- 
tions can be multiplied by the number of items so affected. Differences in 
productivity can also be regarded as the number of structures or units into which 
an element can fit. For example, in the formation of plurals by vowel mutation 
(e.g. E. m ouse -mi . co/ G. Mans -Man se) , Gorman is more productive than is English, 
where the plurals are usually in - s. In this respect, English is closer to 
Spanish than it is to its Germanic cousin. Productivity differences* therefore, 
can be measured by counting the number of items in each language affected by the 
difference, and by calculating the number of possible utterenc.es which the 
distinctions may produce . 

In sum, in measuring the extent to which different languages differ, we must 
distinguish between what we are measuring and where we are measuring it. What 
we arc measuring may be the differences themselves, or the amount of work needed 
to change one language into another (differential and conversion distances). 

Where we do the measuring may involve us either in comparing code items or in 
analysing samples of speech (code and discourse distances) . The techniques of 
measurement used will depend on the types of distances measured. 
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DISTANCE BETWEEN CODES 



3. 1 Semantic Differences 

3.1.1 Uni vers als of Meaning 

3. 1 * 2 Semantic Measurement 

3.1.2.1 Grammati cal Meaning 

3 . 1 . 2 . 2 Le xi ca 3. Me anin g 



3.2 Formal Hi £ fe xen cc r> 

3.2.1 Uni vers als of Form 

3.2.2 Formal Mens uremen t 

3. 2. 2,1 Grammatical Form 
3. 2 . 2, 2 Lcxi cal Form 
3. 2,2,3 Phonetic Form 

a) Prosodic Distance 

b) AJlophonic Distance 



3. 3 Limitations of Intcrcodal Measurement 
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In measuring the differences between the ciocles of two languages it may be 
useful to separate the semantic from the formal, since each poses problems of 
a different nature. 

\ 

3. 1 Seman tic Differences 

Before attempting to measure the semantic differences between languages, 
it is important to consider the problems of analysis in the vocabulary and grammar 
of the languages in relation to possible semantic universals. 

3.1.1 Un j. ve r s a ls o f Me an in g 

The distinctions which a language makes are not limited to gram- 
matical categories. The concept categories of languages are so varied and over- 
lapping that it is difficult to divide them into grammatical meanings and lexical 
meanings. One can imagine the possibility of arranging them into scales of 
inclusiveness in each of the areas of logical environment, including the naming 
of things and relationships, actions and states- of being that all languages must 
presumably have. Communication, a compromise between a desire to include and 
the need to exclude is also a consideration in the evolution of all codes. For 
all codes must make some sort of compromise between accuracy, efficiency and 
redundancy. Language codes have ways of taking the audience into account with 
register terms and intonation patterns for the transmission of semantic overtones. 
Each language must also adapt to changes in the environment with which it has 
to deal. It does this through a process of variation of its categories by 
analogy ( hands of a clock) and combinati ons (G. Stechnadc.1) and collocations 
( as a matter of fact , on a ccount of) . 

Through means that it may have at its disposal, each language must be 
able to label objects (parts of the body, food, clothing, shelter) properties of 

O A 
O 



o 

ERIC 



32 



I 



objects (number, color., sound, shape, odor, taste], notions (eating, sleeping, 
walking), relationship (in, on, at, by* for) , operations (identification of 
self and others, questioning, answering). 

. Also: by the above processes of analogy and coubi.naticn , each lan gunge in 
Its own wny - and often in seemingly arbitrary fashion - associates certain 
meanings (semantic features) to one or the other of these labels. So that the 
same group of features (e.g. thunder) may be associated in one- language with the 
category of things, in another language with the category of actions, and in 
still another language with the category of qualities. 

Keeping in mind the fact that discovery of features that all languages do 
(not must) have may well in crease any measured distance between languages, can 
we still proceed to measuring relative distances in the area of logico-graimnaticnl 



categories where such universal s are most likely to be: found? If one were to 
arrange the semantic distinctions into re tap) there is likely to bo a continual 
overlap of sets, and ii) what is more inclusive in one language (A) may be less 
inclusive in the other language (X). For example, French does not distinguish 
between persons and things in its pronoun system (ij , olio) whereas English does 
(he +shc, it) . But in the area of questions in French ( qua , q uo!) and in English 
( wh o , what) both languages make the distinction. In relative connectives, 
however, only English makes it (13. The person who . , 4 . The thing whic h. 

F. La personne qnl . . . . La chose qui ) 

When we consider the differences in the distinctions which languages make, 
tile arbitrariness in which they group them, and the indeterminancy of the cate- 
gories they create, how can we attempt to measure the semantic differences 
between languages ? 
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3.1.2 Semantic Measurement: 0 V * * KJ 

Although there have been attempts to study the measurement of wan- 
ing, semantic measurement , even for n single Jan gunge, poses some formidable 
difficulties. To begin with, there is still no agree me nt on the meraiing of 
meaning. To a psychologist, it may in 3 an what one feels about a word; to 0 
philosopher, it may mean what one thinks about* it. Psychologists have attempted 
to measure differences in affective meaning by comparing the patterns of word 
association derived from testing different samples of a population; the re- 
sponses have been arranged i:t such a way ns to bring out what is known as the 
semantic differential. Logicians, on the other hand, have tried to reduce 
mo an i n g 1 0 1 ogi c a 1 cat c go ri c s . 

Vdii lc recognising the legitimacy and usefulness of such studies, one 
must, in comparing different codes, consider the linguistic rattier than the 
affective or logical differences in moaning, and try to propose ways by which 
measurements might he elaborated. Ami although it is difficult: to draw a line 
between lexical and grammatical meaning, it is necessary to begin by recognizing 
these conventional and convenient distinctions, since they suggest different 
app roadies to the problem of measurement. 

3 . 3. . 2 . 1 Grammnt i cal Me an \ n g 

Mien a speaker puts semantic categories together to produce 
an utterance, lie may have to take other semantic features into account - and 
these may vary from language to language. The concepts indicated by the word 
tab le , for example, may cover just about as much ground in English as it does 
in French. Hut when it is referred to a second time with a substitute word 
( it/ollo) , the linglish speaker must make a compulsory semantic distinction point- 
ing it out as a thing; whereas the French speaker makes only a formal distinction. 
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repeating the feminine classifications of the article (la., line) , It is ns 
a newcomer into a strange land had to learn not only that the land is inhabited 
by different categories of persons, recognized either (semantically) by what 
they do, or (formally) by how they look or dress. He most also learn whom to 
ask for what. 

Mow can one measure this semantic compulsion to which varying languages 
submit their speakers? The problem is to identify the compulsory distinction 
tli at must be made in each language, Some of these are obvious, like the dis- 
tinction between positive and negative in English; others arc less so, as is the 
progressive in English and the subjunctive in French. It is important to keep 
in mind that wc are here treating only the semantic features - grammatical mean- 
ing, as it were, and not grammatical form. The distinction is important, since 
apparently equivalent features may be semantic in one language and formal in the 
other - and the fact that: they are, is likely to increase the distance between 
the languages. For example, both English and French have possessive determiners 
(E. h is, he r/ F, s on, sa) ; but in English the masculine- feminine distinction is 
semantic; in French it is fonoal, agreeing, not with the owner blit with tlie form- 
class (gender) of the word with which it is used, - son jardln/ his garden, he r 
garden ; sa m arson / his house, he r house. In other words, English, unlike, most 
Indo-European languages - with the possible exception of Modem Persian and 
Armenian - lias a natural gender (semantic), while French - like most languages 
in the family - has a formal gender, But formal gender is likely to vary from 
language to language, making the sun masculine in French ( 3c solei l) and feminine 
in Gorman ( die Sonne ) and the moon feminine in French ( la juno ) and masculine in 
German ( dor Mon d) , And tlie classification is likely to seem unreasonable to the 
stranger. Why should a soldier in French (lc so ldnt ) become feminine when he 
acts as a scntinal (la s entinel le) ? And why should young. girls and old women 
neuter in German, (das Mad chon , das Wcib) ? 
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It is perhaps possible to divide the compulsory semantic restraints of 
grammar into categories related to grammatical function - such as identification -■ and 
to predication. The components of these categories, however, may remain incom- 
plete^ since theoretically any language can contain any obligatory type of 
semantic distinction. That is the significance Off the slot labeled 'other* 

(s ee Table 2) , As a category of predication, for example, 1 other* could represent 
the distinction between verbs of activity and those of displacement ~ compulsory 
in French, but not in English. Compare : 

E. At homo, he walks. / F. Oioz-lud. ,il marche. 

K. He walks home. / F. II va chcz-lui a pied. 

In the categories of identification and function, there are. also other possi- 
bilities that may bo difficult to predict a priori.. The identification of 
positive, as distinct from negative, partitives is found in English but not in 
French (13, some- any / F. de ) . 

Once a complete inventory of these distinctions is obtained (and we do not 
yet have such an inventory) we can use the presence or absence of such compulsory 
forces as vectors in measuring the difference between two languages (See Table 2). 
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UNIVERSAL CATEGORIES OF LANGUAGE CODES' 



There are some disadvantages irj the use of a - general framework of possible 
grammatical distinctions. Firsts there may be difficulty in making sure that 
the framework contains all possible distinctions, ‘Secondly, some languages make 
a number of distinctions at the same time; and one is never sure which and how 
many are intended. There are, for example, about forty possible semantic dis- 
tinctions of case, that is, some forty ways in which one thing ( 03 ' nominal 
concept) may be related to another. In Latin, the nominative case covers one, 
the accusative, two and the dative four. 



Instead of proceeding from a check- 1 
tions in the grammar of languages, it is 
procedure of comparing the two grammatica 
view. In doing so, however, one must be 
function on the one hand, nor constraints 



1st of all possible semantic distinc- 
rossible to adopt the less ambitious 
.1 systems from the semantic point of 
careful not to confuse form with 
with alternatives on the other, A 



single form as. we hove seen may express se veral relations of different types - 
person, number, case, tense, mood, voice. For exarple, the - £ in Lat. amo 
(I love) expresses the present indicative of the first person singular of am re ; 
the single form -a tur i-n amntur (He, she or it is loved) expresses distinction 
of voice, person and tense. 

A constraint cannot be treated simply as an. alternative. Take the forms 
of the French verb as shown in the words a fini f: nit and finissait . There 
may sometimes be three ways of viewing past time; but only two may constitute a 
semantic constraint, since the speaker of French may ignore the distinction 
expressed by the simple past and get along quite well with the perfect and 
imperfect tenses. 

Keeping these precautions in mind, therefore, wc can isolate from the two 
languages the cases where more semantic distinctions have to be made in the 





grammatical forms of one language than in the corresponding forms eff the 
others This is equivalent to the measuremenl: of the conversion distances in 
the grammars of both languages - measuring in both directions, the difference 
beijig D - (A->X) + (X -•$> A) , From this point of view, then, let. us 

take a look at the verbal systems in English and French. (See Table 3.) 
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TABLE 3 

SEMANTIC DIFFERENCES IN ENGLISH AND FRENCH VERBAL SYSTEMS 
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Not all those differences , however, can have the same value within the 
respective codes. Regardless of frequency of usage, the degree of compulsion 
is not the same. One could argue that one of the differences in the codes 
between English and French is that the former has maintained the Indo-European 
dual in its category of number; but the forms in which it is found, although 
frequent, are residual. All languages must have such residual forms, since all 



living languages evolve. These forms will come out in the measurement and will 
have to be weighted for their productivity. It is true that the Indo-European 
number category did have a dual - for example, in Sanskrit (Sk. l^riu) - as well 
as a singular (Sk . sa) and a plural (Sk, to) . Although this dual lias disappeared 
in French, it remains ns a residual form in English. Compare: 



You can't take bot h , On ne pout pas prendre les deux , 

but you can pick the better;- mais on pent choisir lc meillcur. 



You can't trike all, On ne pout pas tout prendre, 

but you can pick the best. mais on pout choisir le mol 1 leur . 

In the last analysis, grammar is simply a means for speakers to establish relations 
between. meaningful elements of their discourse. What is coded as grammar, and 
what is not, cannot always be an either/or proposition. A grammar of English, 
for example, will vary according to how one classifies the words get and keep. 

If they are considered like have and can , as having grammatical functions, then 
we would have to add two more aspects to the system of the English verb, since 
these function words can combine with perfective (-ed) and imperfcctivc (- in g) 
grammatical forms to produce what could be called grammatical meaning, which 
would have to be rendered in French by differences in mood, voice, or vocabulary. 
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Compare : 



reive 



Purative 



Perfect 


Iwperfect 


Perfect 


I perfect 


S. get married 


get going 


keep married 


keep going 


P. se mordent 


se mot tent on 


restent mnrics 


continuent 


route . 



Since many semantic grammatical categories exist mostly in the minds of 
the categorising grammarians , it seems advisable in measuring the semantic 
distance between languages not to rely entirely upon them. It is necessary 
to sec how many of the differences can also be covered by concept categories 
considered as vocabulary. 

3 . 1 . 2 . 2 Lor i c a I Me ai ) in g 

Since measurement is not description, available inventories of 
semantic features may be useful in establishing the indices of difference. 

Many of these inventories - even the most abstract cmd grammatical - can be 
found in certain unilingual and bilingual dictionaries - imperfect as they may 
be - supplemented by the descriptive and differential grammars of the two 



languages. 

Because the groupings of distinctions of one language may overlap 
a number of groupings (categories) of the other language, and vice versa, it is 
necessary to measure the differences from both viewpoints: (A~X) = (A ~>X) + 

(X A) . And since even the most complete dictionaries and grammars differ on 
what is category of meaning and what is usage, in referring to this valuable 
source for the measurement of semantic distance, it will be necessary t.o include 
both. 
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Fox example , if one takes the concept of man - which may perhaps be 
universal - and see what two languages, like English and French, group into it, 
one soon 7iotices that although the groupings have a lot in common, there are 
remarkable divergences* The concept (man) does not covgv the same number of 
areas of meaning, and in each area it does not cover the. same specific uses . 

For each concept, the degree of congruence can be used as a measure of close- 
ness or distance. 

Since an inventory of this concept and its uses may be found in a modem 
bilingual dictionary, v?c shall use the same dictionary (e.g, 1 1 arra p 1 s hn gl i 

French / Fron ch - En g lish Dictionary , Rev. cd. 1967) to measure the differences. 
Following the categories of the dictionary, v.c find that it recognized six 
categories for man (including one sub category) (A~6) and four for homn ie, (X ~ *1) , 
making a total of ten. Only two of these categories (mankind, and adult male) 
arc identical, (A - X) 2. , giving an overlap of 20%. 

Not eacli category, however, h ns the same number of usages or ’meanings 1 , 
and in these there is also a possibility of overlapping. It is therefore wise 
to count in both directions, i.c., the crises in which man - horame and 
homme ~ man. For example, although the category adult male ( man / i to mine) exists 
in both languages, this does not man that usages within the categories are 
always equivalent. In some cases they arc ( I’m your man / Je suis votre homme) 
and ( Soyct un homme / Be a man ) ; in other cases they are not ( Good man ! /Bravo!) 
and ( Homme a f c mme / L ady killer) , If we add together these cases in which 
languages arc not equivalent, we get another measure of difference. 

In the six categories of man (translating each usage of English into its 
French equivalent) we have 15 cases ( a = x) of equivalence and 42 cases (a /x ) 
of non-equivalence in a total of 57 usages listed. Contrariwise, in the four 
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categories of h o mao , this time translating from French into English (F/F) > we 
find 12 cases (x ~ a) of equivalence as against seven (a r x) of non-equivalence 
(F/F) out of a total of 19, If we add all these differences and express them 



as a percentage- of the total number ’of categories and constituents, we get a 
semantic distance of .62, by applying the 'foil wing weighted formula; 




Since dictionaries and grammars, however, are loaded with low- frequency, 
unprodu.ct.ive, rare and obsolescent forms, it seems necessary to select those 
concepts which represent that part of the code actually in use, or the more 
general, inti. versa! or usual concepts. By way oi example, we have selected ten 
general concepts and have arrived at some sample measurements of the distance 
between them in English and French (S ee Tabic d .) » ,p 

Identity of meaning, however, does not indicate identity of form no. more 
than the degree of semantic difference indicates a corresponding degree of 
formal difference. 
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TABLE A 



SAMPLE SEMANTIC DISTANCES BETOKEN ENGLISH P, FRENG1 LEXICAL CODES 



man 


/ homme 


62.0% 


time 


/ temps 


73.4% 


way 


/ fa con 


89.9% 


state 


/ e tat 


*7 V V<> 
/ O . «_>0 


world 


/ monde 


72 . 7% 


life 


/ vie 


65 - 9% 


come 


/ venir 


75 . 4% 


sec 


/ voir 


65 . 4% 


1 j i pj i 


/ haut 


-1 r r*. 

/ J . U V 


house 


/ maison 


74.6% 



Semantic distance of ten lexical itevns in English and 
French calculated by percentage of two-way differences 
in concept coverage of an English word and its French 
dictionary equivalent. 
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3.2 Fo n un 1 lilf foTtnccs 

The formal features of language are more evident than the meanings they 
convoy, the basic, needs of coroiaunicotion and the limits of the human speech 
organs combine to place the formal features of languages within certain bounder! e 
Let us discuss these universal formal features before elaborating ways of 
iieasuring th^oir differences . 

3,2.3. Un :i.ve vp nls o Po rm 

The needs of communication require a limit to the length of an 

utterance necessary to convey a conventional semantic feature - and the more 

frequent or usual the feature, the shorter it is likely to be. But there is 

also a limit to the shortness, since another basic need of communication is that 

the messages reach their destination (the listerner's mind) intact despite the 
/ 

distortions of lumi.au and therefore imperfect production and the transmission 
losses due to noise and distance, so that if one feature is lost another can 
take its place. In other words, there must be a number of redundant features in 
any language system. All systems of all natural languages seem designed to 
produce redundancy.. And this redundancy is often constituted in the "useless 1 ' 
constraints which languages oblige their speakers to use. They appear in the 
grammar pf. languages as rules which seem pointless find arbitrary to the learners, 
rules like the use of both preposition and case ending to convoy the same idea - 
(G, i n seine]]) Bcru f - trotz seines Bcr uis; L. a -mar i. - ad mare ) - but which 
must be obeyed. 

In order to be able to create utterances, all languages must have some 
sort of elements - such as words and speech sounds, which can be said to form 
part of the code of the language. All languages in order to become means of 
communication, must make some sort of distinction between, these elements in the 



code and the utterances which they can produce. Otherwise a language would 
be simply a collection of ready-made utterances, quite incapable of dealing 
with the complex needs of man. The potential number of utterances must there- 
fore be greater than the number of elements in the code of the language. In 



to produce different utterances. For example, the utterance ‘dog bites man ‘ 



each structure containing types of elements in certain positions, forming the 
units of the structure. 

These units arc the immediate divisions of the structure into its components. In 
some languages, clauses are the immediate divisions or units of the sentence 
structure, phrases the units of clause structure, words the units of phrase 
structure, etc. In this way, some structures are more inclusive than others, 
and this inclusiveness forms a hierarchy of inclusion which all languages contain, 
since the minimal operational number seems to be three, e.g. , sentence, word, 
sound. 

The necessary difference between code and message supposes that a structure 
must bo able to produce a number of different utterances by using different 
combinations of units (dog bites man, dog bites bear, bear bites man, bear kills 
man, etc.). The more different utterances produced, the greater the productivity 
of the structure* This depends on the number of components of the unit which can 
occupy the same position, that is, it depends on the depth of the ’well* into 
which the structure can dip, as it were, to produce new combinations by filling 
its blanks. If the position is occupied by a determiner, ( this , that , the, his , 
her, etc.), for example, the depth will be less than if occupied by an adjective 





different ways in which the elements combine are tire structures of the language. 
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( gocd , bad* tall , blue, etc.)- The depth of a class of units J n one language 
is not always the same as it is in another .Final ly, length , class and relationship 

of the units obviously vary from language to language. 

To sum up the universal formal characteristics of languages, therefore, 

we can state the following: 

i) All languages contain elements. 

ii) These elements comprise units and structures. 

iii) Some of these are more inclusive than other, making 
, all languages essential ly hierarchical. 

iv) The units appear in the structures in a certain order. 



O 




v) Each unit has some length. 

vi) Each class of unit has one or more members, giving the 
class depth. 

vii) Each unit is in some way connected with other units, giving 
it a set of possible relationships. 

The formal elements of a language ai:e not necessarily determined by the 
way the language categorizes reality. The fact, for example., that two languages 
distinguish persons from things does not moan that they use the same sort of 
formal elements to do so. One may do it through its determiners; the other 
through formal non-classes or word endings. The semantic potential of a 
language - its capacity to express and combine categories in various levels of 
meaning (logical, grammatical, environmental, interpersonal and intratextual) 
is one thing; its formal potential is quite another. 

The formal potential of a language is revealed in the means or choices it 
has to generate the sort of meaningful messages of which its semantic system is 
capable. These choices operate both vertically and horizontally, as it were, 
selecting the appropriate structure down the hierarchy while bringing in the 
appropriate units to combine into a linear sequence which is t arms formed by 
the speaker into an utterance. It may be that the higher the level in the 
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ah st ract choi ces 



hierarchy, the less the difference between languages; the most 
may be universal. But at: the bottom of the ladder, at the level 



of the sequence 



of sounds constituting the utterance, the cliff 
easily be seen and heard. They constitute the 
whore all languages must again find an outlet 



'erences between Inn images 



can 



other end of the continuum 
in the physical world. 



3.2.2 Formal M e asure me nt 

The formal differences between language, codes con bo measured 
through elements that are comparable. A word or form in one language can be 
compared with a word or form in another language, a sound in one language can 
be distinguished from a sound in another language. It is therefore possible to 
separate grammatical form from lexical form and phonetic form. 

3.2.2. 1 Grammatical Form 



Although the vocabulary of two languages may be 
highly similar in form and meanings, the way it is put into operation may 
require quite different activities in one language than it does in the other. 
Although the form p olic e looks the same and means much the same thing in one 
of its usages in French as it does in English, .once the English speaker decides 
to use this word in a French sentence, he has to take into account the grammatical 
differences between the two languages, not only grammatical meaning, classifying the 
notion as singular rather than plural, but formal aa well, making them (or it) 
feminine instead of masculine ( la police ) ; and he lias to maintain this formal 
distinction whenever he adds an adjective ( la pol ic e italienne) . It is as if 
our newcomer in a strange land sees familiar objects related to many unfamiliar 
persons, and lias to decide between tin* persons; so that he is forced to make a 
choice between types and relationships he did not know existed. He must at the 
same time learn what they are, what they contain and what, to do about them. 
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le 



die z -el le 



police ? 



it alien 



va 



la 



italienne 



chez-lui 



vont 



If liis language is not too different, ho may be 



familiar with the type of distinction, without knowing 1*3 ~ distinction itself. 
For example, if the stranger to French were Italian rather than English, the 
word polic e might not look quite so familiar to him, since the Italian version 
is a bit different ( poll z in) ; but lie will be familiar with the process of 
formal word gender. What he will have to learn is the specific gender of the 
French equivalents. And even some of the close cognates may mislead him. In 
this case, ho is likely to be right three out of four times and wrong one out 
of four times, since the difference in the gender of Italian- French cognates has 
been estimated as about 25%. The English speaker, on the other hand, lias only 
a fifty-fifty change of getting the gender of his cognates of French. But lie 
first has to get used both to the idea of formal word gender and also learn 
which word is which gender. This leads to the idea of level or hierarchy in 
distinguishing formal grammatical categories. And it suggests that it is unwise 
to add a distinction at one level with a distinction at another until levels 
have been normalized in scale. 



possible, like the actor-action distinction, and by a continual sub-classifica- 
tion process, work down to the degree of differentiation in word- forms, we can 
keep the levels separate and then average the differences. 

Our inquiry would go something like this: Are there 

formal distinctions between actor and action and goal? What are they? Jn the 



If we start with the most general formal distinctions 
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actor part, of the utterance, what formal distinctions exist? In each of these 
(e.g. nouns) wluit formal distinctions are made (e.g. gender)? How a-'e these 
distinctions made (e.g. affixation , relation words, word- order) ? 4*1 f: at re i ac.i or s 
are formally marked (e.g. determine r-noun) ? Hov; are they marked (e.g. agree- 
ment in form)? What is the extent of the relationship (e.g. 24 forms of 
de te rutin at i ves in French as against 12 in English)? What is the degree of 
formal relationship (e.g. the common th- element in t his -t hat and th ese -thos e) ? 
Let us sec how answers to such questions would produce quantifiable results. 

( See Ta ble 5.) 

These differences ir. the forms of function-word 
groups, after having been thus measured intralingidsticaliy , are then measured 
interlinguistically > that is, against their corresponding opposites in the 
other language - the against lc, JU?* and los , their against lour, etc. 

These differences, however, can bo measured at the lexical level., along with the 
vocabulary of the languages, showing distance in lexical form. 
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TABLE 5 



SAMPLE QUICK UST OP FORMAL DIFFERENCES IK KOMINALS 




B 

P 

•H 

8 

o 



LO 

3 

g 



Pi 
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•H 

CJ 
R 
■p 
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o 
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R 
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R 

p 
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X 

R 
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O 
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R 

P 

o 
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g‘ 

ir 



to 
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*x 

CM 



fX 

CM 



Pm 



LO 

P< 

CM 
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p~f 


R 
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R 
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R 
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rP 


to 


to 
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o 



o 

to 

o 

rP 
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0 
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■H 
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rP 

+J 
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+4 

0 

45 

+J 

0 

X 
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H 

to 

rx 
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to 
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<0 
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q< 
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to 

IH 
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E 
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to 
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CM 

4h 
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CM 

Fi 

CO 

CM 

p, 
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E 
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4t 
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to 

S 
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to 

R 



H 

P 
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10 

O 

> 
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O 



g 

to 

to 

G 

to 

P 

to 

p 

o 

to 

to 

0 
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Pi 
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p 

to 
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0 

■M 

+J 

0 

0 

0 

0 

0 

+J 

R 

0 

0 

0 
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CM 

II 



to 



to 



to 



ai 



0 

rH 



+ 

t**t 

+ 

H 

•I- 



CM 


to 


| tH 


CM 
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1 «M 


CM 
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+ 

| rH 
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to 


'1* 

1 -H 


CM 


to 


+ 

1 rH 


CM 


to 


-!♦ 

| t ( 


CM 


to 


+ 
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CM 


to 
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CM 


to 
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A- 


CM 


to 


1 »H 


CM 


to 


H* 

1 rH 


CM 


to 


H- 

1 rH 
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+ 

1 rH 
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+ 
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o 

to 
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o 

p< 



R 

to 
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p 

D 

to 

R 
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P, 

'0 

P 

CM 

II 
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O 

to 

R 

O 

Pm 

R 

to 



11 
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'H 

|5 

•d 

II 
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f5 

•R 
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P 

V 

3 

£ 

II 
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R 

P 
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II 
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R 
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•H 

to 
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■H 
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,G 
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*4H 
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VA 
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The most obvious of all language elements 



is the word. An unsophisticated speaker of a language can recognize some of 
its words, although be. .nay not be able to isolate the syntactic, morphological 
or phonetic elements. An uneducated language user can also recognize similari- 
ties between word- forms of different languages (e.g. G. Man n and E. rnan) . He 
does so not only by the type and number of similar sounds or letters (e.g. t i me 
will not: be identified with e mit , although both words have the same type and 
number of letters), lie must also recognize a similarity in the sequence, pattern 
or configuration of the units, their order and position (e.g. times and tem po 
will appeal: as more similar than time and emit ) . These similarities in configura- 
tion can be seen or heard by making a linear juxtaposition of one component in 
one language with its counterpart in the other. The question is one of sequence 
rather than of components. There are two types of linear correspondence - 
positional and equivalent. Positional correspondence is the one-to-one 
juxtaposition of elements in two interfacing units according to their positions. 

For example: 13. cat 

F. chat. 

Equivalent correspondence involves the interfacing of two uni ts according to 

the similarity or identity of their constituent elements. There is maximal 

equivalence when the interfacing elements arc so juxtaposed as to achieve the 

optimal number of identities without changing their sequential order. For 

example: E. c at 

p, ch a t 

This latter arran gmenent is the appropriate measure for comparing the content of 
units; the former is suited for the measurement of their structural difference 
(See below). Using one type of interfacing instead of the other is likely to 
make a difference in the results of the measurements. For example, 



English c otton’ and Spanish algo cion a being cognates, have a csrta 
fonnal similarity. Which of the two measures is likely to rove a 
Compare: 



Positional Correspondence 



Equi valent Corre s : 



E. 

S. 



cotton 
a 1 g o d o n 



E. c o t t 

S . a 1 g o d 

All pairs different. Three pairs ido; 

In other words, one calculation makes the words entirely differer 
the other gives their difference as being a little more than hal: 



O 
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Ln cegri? cf 
L this? 

? on den efr 

o n 
6 n 

itical 
it in foion; 
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In the cmc-s c>f two 3 ; n gungos with highly unphtnologj c ci 3 spell- 
ing r»> . . cIjU , such maximal vis in] torrc-spoiulenrc ii X’.y make son? word- feu ms appear 
more similar than the unsophisticated sp* ; !ccx of either language would 
estimate. For example: 

l\ h o in m e 
F;. n a n 



This 


is 


a voi do d if the into i f aci n g i s 


donu on the basis 


of c on s t j t uc 1 1 1 s y 1 1 ab 1 c 


un i 1 5 


• 










n. 


cot ton 


I: . m a 


n 




s. 


algo do n 


P. h o 


in in c 



Sy 1 1 ab i c a 1 ly t! i e Hn gl i sh moil osy 1 1 ab 1 e 1 3 me can be inter fa c e d w 3 f h t b o Sp a ni s h 
di -syllable tiej.po: 

t i m e 

V i c m p o 

One w ou 1 d not tber. identify the -e in tine v:jth the -c in Vicmpo, Once two 
form* have been thus juxtaposed ,or interfaced, it is possible to measure the 
formal distance between then. Pi stance in the written forms and the spoken forms 
can both be noasmed. Since the spoken forms, however, suppose the juxtaposition 
of equivalent sounds, which may differ in degree (c.g. the /t-/ in time and in 
fierpo are not pronounced in identical fashion) we begin with di fferences that 
can be seen ,and leave until later the more complex measurement, of differences 
that can be heard. 

By a linear juxtaposition of two graphic forms, therefor', v:c 
car, see whether or net two interfacing eleinents (letters) aixj the same oi 
different. If they are identical, the distance between the two elements is 
obviously nil and can consequently be counted as zero. If they arc different, 
there arc two possibilities - either there arc no interfacing letters, in which 
case Die distance can be counted as one; or the interfacing letters are differ- 
ent, in which case t he distance is counted as two. This can be reduced to a 



ERfC 
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i) Interface for n-:ixi i.u-i 1 syllabic equivalence. 

ij) Count combined total uni is (T) • 

iii) (heel and count identical interfacing units held 
in common (C) . 

iv) Subtract tlic.se (C) from the total (T) , giving a 
iron sure of the distance (1 j). 

v) Express m relation to the combined length (T) . 

So that, 

v*)' - T -r £ - 

Going bach new to our Inst example (E. t imo - S. t_ic^r;;]o) , the distance would 
be 4, calculated thus: 

t i m c T (total units) « 10 

or 

t_ i^o rri )) o C (common units}- 6 

0 + 1+0 *2+1 = 4 Diffcioncc = 4 



D 



(timo- ti e npo) 





4 

To 



These differences may also be reduced to percentages or jnade to range between 
zero and one, a procedure that would be needed in adding or comparing differences 
For example, we can see that the difference of 4 between time and t iemp o , being 
4 out of a combined total of 10 is .4 or 40*. For the next concept (man), the 
formal distance between the English form (man) and the French form {h ome) 
interfaced sy 1 labi cal )y , is 8 out of 8, that is 1, or 100 o. 

Tins reduction permits us to coiribirtc formal distance with semantic 
distance, likewise expressed in percentages ,by showing the one below the other. 

In the above example, the semantic distance D for ( homme- man) was ,62. So 

that, 

(homrnc^man) ,62 

D -= 

(homing man) 1 



O 
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An oven i.\>re accurate measure can he obtained by weighting each interfacing 
similarity according to its position in the linear sequence, since it may be 
argued that it is linearly that languages arc perceived. But since this involves 
more c c> mp 1 c x c a 1 cu 1 a t i on s , we si s all 1 o a ve it until laic v (5_. 2 b e I ow ] t 

Lot us now extend our measurement by considering the difference 
between how the word s h n ve to b r* p ro noun cc d , since di f i e ren cc s in 1 e xi c a 1 f o rn 
can be measured either as to how the words look or as to how they sound. The 
latter will depend on our knowledge of code difference in phonetic form. 

3 . 2 . 2 . 3 Phon etic To n i 

In measuring the distance between two codes, we have 
made a distinction between formal and semantic measures, the latter having 
possible reference to the physical world and expressing character! sties which 
are not United to language. In pushing our formal measures further and fur- 
ther down the linguistic hierarchy, we also end up in the physical world. Fo:; 
sounds, like meanings, are not limited to language. 

The ways in which meanings or concept categories are 
designated in sequences of language sounds are limited by a number of the hum in 
vocal tract. Because these sounds 3re produced on a time sequence, there must 
be limits between one sound and another. Because of the way the human breath- 
ing apparatus is constructed, there must also be units related to breath. But 
how far can we postulate the generality of other phonetic features? Ke might 
be safe in postulating the universality of the syllable, the consonant and 
voiced sound, without going so far as to claim that all languages have stops, 
fricatives, labials or dentals.. 

It would perhaps be safer to speak of biological 
uni vers als and say that all normal human beings have a mouth, a tongue and a 



throat. And since all linguae./' sound 2 ; rust be produced within the limited 
capacities of the e human orpins adapted to the product? c. s of sounds, their 
uni .vers al physical potential (mouth closure, tongue position, etc*) ray be 
used as a means of Measuring the difference between languages at the base levtl 
in the h i c r archy of 1 an f;ii a ge u; m t s . 

On tliesc universal physical feritu:vs of the htusn body, 
it should be possible to establish valid tonsures of difference in the? produc- 
tion of speech sounds, since we can assume that the essentials of the vocal 
organs are the saw? for all iron in all places, cr that everyone at birth lias 
the p otential for learning any human language. by r, assuring the relative 
position of these organs in the production of e £ eh sound of a language, we 
can determine the extent to which It is different from any sound in any other 
language, Khat ar 2 these organs? They arc those which change the- i r posit ion 
or si) ape to produce significant differences in speech • the tongue, the lips, 
the jaw, the muscles of respiration and the vocal cords . these organs together 
produce two types of physically Measurable units - one acoustic an d the oilier 
physiological. The first is the appropriate measure for the prosodic distance? 
of syllables; the second, for the allophcnie distance of speech sounds. 

a) Prosodic D is tanc e 

If we listen to a language we do not understand, 
we are first struck by the rise and fall of the voice and the beats of sound 
varying in length and loudness. Some people learn to recognize languages they 
do not understand simply by listening to these featur s of rhythm and intona- 
tion. 



One can hear t lie difference between the syllables of 
languages, since they vary in the level and direction of the tone of the voice 
and in the relative length and force of their constituent sounds. This is 
easily observed in the pronunciation of horophonic place names and loan words. 
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The nitre Canada sounds di ffonnic: in English than it c-cvs in J ? rcnc)i or Hungarian , 
not 50 midi beta isc the sowuls ore pronoun cod with a _d.:i flu <15 .f Terence , as 
because of the force and 3ength of the syllables. Compare: 



English C a 

French C a 

Hungarian C a 



n a 
n a 
n a 



d a - loud/ weak/ weak 

da - medium/ medium/ loud 
d a - loud/ me d i uni/ n >2 d i u r : 



It is through such homonyms that we can begin lo measure the syllabic £ prosodic) 
distance bei* we on pi i on t t i c codes. 

There arc at least t.vo ways that syllabic differences 
can be measured - instrument ally and taronoir.Lcally, A single Inst runout , such 
as the cathode ray oscillograph or the acoustic spcctogrr.ph can re vea 1 differ- 
ences in syllabic length and tone, measurable within a difference of a sijigle 
cycle of vibration per second. The disadvantage of such instrumental neasiu'c- 
iidit is that it can itsc? samples from only a limited niuibcr of persons, so that 
minute individual speech and voice differences t?nd to obscure the differences 
between the language codes* Instrumental measurement can, liwevcv, be a hc’p 
in establishing usable scales and categories of relative difference* 

If we examine the relative differences only, we 
can place then in a number of categories by using a series of such distinctive 
features as high and low, strong and weak, fixed and moving, thus establishing 
a nuniber of distinct categories within each language for force, length, height 
and direction of the syllabic tone. For each of these four syllabic variables, 
we can establish a scale, giving a numerical value to each point {c.g* short = 0, 
medium = 1 and strong = 2)* By using these values as vectors, we can subtract 
them one from the other in interfacing syllables, to measure their prosodic 
distance in natural n umbel's (S ee Tabl e 6). 
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In ri' nsur'ui g prosodic di st i .ce , therefore, ;:o t rht 
four syllabic variants .into account : fore-; , lciijth, direct! on and hoi gbt of 

voice. (Catenation is accounted for in phonetic rersurertont . ) 

JPorco : A syllable in Ihe stream of speech my Appear 

louder or quieter than that of its nei &h hours . Tills is 
measured by giving n value of 0 to the weakest (so as not 
to make variations of force dependent on length, the adding 
zeros gives zero). So that it i .. entire iy possible for a 
syllable to be longer and at the sai.c tii.o quieter than 
another, for the loudest syllable we attribute the value 
of (2). 



o 

ERIC 



length : The individual length of the syllable is measured 
by giving the value of (1) to the shoitost in the length 
repertory of the language and (3) to the longest, while 
giving the value of (2) to the others. 

Direction: Comparing two syllables, it nay be noticed 

that the tone cm remain level throughout the length of 
the syllable, or that it changes in one, two or more 
direct ions during the period of tine tal.cn to utter the 
syllable. If the tone remains level, wc give the value 
of (1) to the syllable; if it changes, we give values 
according to the direction and amount of changing. 

Heigh t : Listening to two different syllables » ria >' 

reveal that the one is higher in tone or lower than the 
other. One can measure this by coreparing the tones 
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syllable by syllable to sro the extent to vrhich 
they correspond. The value of (0) is given to the 
lovest tone and (5) to the highest (See T^blc C ) . 



For example; 



tfcaau recent of prosodic, differences in the English and French prenmeiai i on of 
jCanndn, gives the following results: 



RHYTHM 

Force 



E. 

r. 



Vectorial Distance 



CA NA DA 



2 0 0 

1 1 2 

1 + 1 + 2 



4 



Length 

E. 

F* 



2 0 0 

1 1 3 



Vectorial Distance 



+ 1=3 



INTONATION 

Di recti on 



E. 



F. 

Vectorial Distance 



1 0 0 

0 0 0 

1 + 0 + 0 = 1 



Height 

E. 

F. 

Vectorial Distance 

O 

ERIC Total Vectorial Distance 



3 1 1 

2 2 1 

1 + 1 + 0 



_2 

10 
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TMWJ: 6 



PROS 001 C VALUES ATM) NOTATION’ 
(S = sy 3 J able) 

















(marked nlmve 














the ii) 














l : o rce 


v;cak 


modi un 


s t r on g 








symbol : 


(tninarhc d) 


i 

S 


IT 

s 








value : 


0 


1 


2 








Ion gtli 


short 


medium 


long 








symbol : 


S 


S 


S 








vn 3 u? : 


0 


3 


2 






















(marked before 














the sy 13 able) 














Direction 


static 


falling 


rising 


rise- fa 31 fall- rise 


3 changes 


symbol : 


! S 








s/ S 


/V.; 


value : 


0 


1 


2 


3 


4 


i; 


Height 


extra lav 


low 


mid- lc‘y 


; mid-high 


hi gh 


extra high 


* symbol : 


i s 


i s 


i S 


■s 


■s 


's 


value : 


0 


1 1 


2 


3 


4 


5 


^Direction syi 


+ 

lib o 1 s ( | \ 


/ a v/v/) are 


written 


below the line 


CO) , 




through the line . (3) 

~i — 


, on the line 


CM 

w 


, under the line' 


—(3), 


through 



top line ^(4), on top of top line (5), 




* 
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b) All o]ihon i \) } r, t i-n ce 

As one approaches the physical basis of human speech, 
our pnssibili tics of accurate measurement increase. If the above typo of 
vectorial measurement is applied to different speech sounds, these physical 
variables can be given scale values corresponding lo differences that can 
be seen as well as heard. They can lie seen directly or indirectly with the 
aid of such photographic techniques as paint og> nphy and ci n e radio log y 

Since any sound can be described by the action of the 
organs which produce it, the extent of such' action can be given a value on a 
scale. With six scales corresponding to the action of 1 lie speech organs, 
all relevant speech sounds can bo described ( See figure 1 ) „ 

bach sound in each language can therefore be neasured 
according to ] (its point of articulation), IT ( the degree to which the 
organ affected is closed or constricted ), 111 (timing of action of the two or 
more organs involved in production of the sound), IV (shaping of other organs 
involved in the production of the sound), V ( the direction of the flow of air 
through the mouth or nose by the action of the velum) and VI (the action of the 
vocal cords) . To cadi of these variables scale values can bo given ( See 
Table 7 ), 

To measure the distance in the phonetic code of two 
languages, juxtapose all their phonemes and, using the scale, ireasurc the 
degree of difference in their distinctive features, for example: 

English /a; / is: I (fronted = 4), II (mid- open to low 6.5), 

4] 6 . 5 1 3 1 0 1 0 J l| III (continuous = 3), IV(non- const rl ctive = 0), 

V(oral = 0) f and VI (voiced = 1). 
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TABU: 7 



VEcroiuAi. VAUJiis op I'lioxnf-s: :o:-po”);xts 



Values 


0 


I 


2 


3 


A 


5 


6 


7 


Scales 

I 


- 


bi 1 abial 


] abiockmta 1 


i n tcrdontal 


apical / 
front 


coni ral 


back 


glottal 


II 


- 


i 

closure 


1 "* 
fri ct i on 


ser.iivoKo 1 


hi gh 


ri en- 
closed 


ini d- 
open 


lev 


III 


- 


no con- 
st ri. cti oa 


moment ary 


cent i n uous 


1 on gt !i - 
cried 








IV 


no co- arti- 
culation 


VO c 1 i c 
(off- g lido) 


a ff rical ion 


! 

| 

1 










V 


cl oscd 
velum 


open ~ 
velum 




i 










VI 


- 


vibratlrT^ 
vocal 
co uts 


vocal cords 
open 


j 

i 

_ __ j 
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Figure 1 



s oiiiMAVJ.c vniv: of phoni-’v; dks 





>o 
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French /a/ is: 



1 (fronted - A), II (low = 7), III (continuous = 3j 
IV(n on- const ricti ve " 0) , Vforal -0), VI (voiced - I). 



A 7 8 P 



0 1 



Juxtaposing the comparable values and subtracting in natural murk era gives 
the vectorial distances between the two soiuds, thus: 

r ii in iv v vi 

E. / ao / : 4 6.5 3 0 0 1 

F. / a / : 4_ 7 3 0 0 ± 

D /re - a/ =• 0 + . 5 + 0 + 0+0 + 0 - .5 

The vectorial distance between the two sounds is therefore .5 . 



When all the sound features of one language lave thus been juxtaposed v.ith 
those of another, the result is a reference table to vectorial distances in 
the jhonctic codes, by way of example, we have elaborated such a table to 
measure the vectorial distances between the sounds of English and those of 
French ( geo Yrhlc 8 j f 
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Since there is rarely a one-to-one correspondence between 
the phonemes of different 1 anguages , a phoneme in Language A nay have to be 
compared with two or more phonemes of Languor,? X. For example, the Spanish 
[fi] may bo compared with the English /b / , /v/, and /v;/ . In comparing two 
grammatical or lexical forms the individual phonemes nay differ according 
to position or adjacent sound, as do the two /d/s in the Spanish word /dedo/. 

Neither graphic nor orthographic representation of the formal 
elements of two languages gives a complete picture of the degree of similarity 
between two forms. A phonetic to phoneme comparison of interfaced equivalents, 
however* measuring differences in distinctive features reveals similarities in 
form that may seem quite different to the eye. Whether the letter £, for 
example, is compared to ); , q, or or any other letter, it is always indicated 
as a difference. The similarities of words written in different orthographic 
systems can he reduced by transcribing the words into graphic representations 
of the phonemes so that differences in spelling do not hide similarities in 
word-form. For example, compare the following: 

E. cat’s E, / k a t s / 

G. Katzen G./katscn/ 



E. quarter E./kwortor/ 

S. cuarto S. / kw.arto/ 

The choice of transcription will depend on whether it is the distance in the forms 
of the language as forms, that is measured, or whether such communication 
variables as homographs and homophones arc accounted for. If distance between 
spoken forms is measured, a transcription of the phonemes may not be sufficient, 
since it does not reveal similarities in such audible features as voicing, 
point and manner of articulation* so that the siiiilarities in such pairs as 
An.b/, / 1 , d/ , and A>g/ are not taken into account. In order to do so, 
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comparisons br^ed on distinctive font nic differences c bo used. To 
illustrate the lb roe possibilities, take one of the above examples of the 
Hn glish /Sp an i sh c o gn at cs cot t on / rd TP < ■ on : 



01 h o gr u] ih i c JT ;■ :oj ogi c a 1 A) 1 oplion i c 

cotton k o t o u k o t o n 

algo don algo don algo don 

1+1+2+0+1 +2+0+0 1 + 1 +2 +0 + 2+2*; 0 10+10 + 1 +0 + H-1.5 + 1.5 

D- 7/13 = -54 D * 8/12 « .67 1) - 24/120 « .20 



After having taken the syllable distance and accounting for the difference in 
stress, use of distinctive fanturcj of allophonic comparisons make the two 
words sound much closer them they would if measured from their orthographic or 
phonological forms. Note that on the same scale of a maximum distinctive 
difference of 20 between any two sounds of bullish and Spanish, the distance 
between the /k / of English and the /g/ of Spanish is twenty times less in 
distinctive features than it is in simple graphic representation, where the 
phonemes are regarded as being entirely different. 

Such vectorial tables of distinctive feature differences c;m 
be used not only in measuring the difference between the formal elements of 
the codes, but also in calculating the distances between equivalent stretches 
of discourse. 

Differences or identities in codes do not, as we have seen, 
necessarily produce corresponding differences in their equivalent messages. 

The full extent of differences between languages can be seen by measuring how 
they differ in performance , that is, by observing the codes in action. In 
other words, measures for distance in discourse must be obtained. 
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3.3 Lh ;i \ *u. i e -s of J nt crcudal Meosuiejiint 

Differences in codes ore no guarantee of corresponding differences in 
the messages they produce. 

The lemons are that the distribution of similarities nnd differences 
produced b/ even the most exhaustive juxtaposition of two codes will not 
nccessaiily be identical to those in two equivalent texts. Examine the follow- 
ing and compare the siullariti.es within the first set (Set I) of equivalent 
sentences as against those within the second set (Set II): 



I 

lh Here cones the old man who js 
blind; give him your hand. 



G, Hie? 1 k or. iint dor alto Mann dor 
blind ist ; gib ih‘:i die Hand. 

lh Void 3c vieillard qui cst 
avouch;; duunc-lui ta main. 



II 

13 . A permanent committee of the 
government is responsible for 
changes in the Constitution. 

G . hi n s t hn d i go r Pc gi e run gs a usscluus 
irt A\r die An do run gen dos 
St aatsgruTidgosctzcs vcrantwortJ Lch. 

P. IV. coral to permanent du gouvemo- 
ncnt cst res pons able dos chan go- 
me nts dims 3a Constitution. 



It is obvious at a glance that in one case (I), the English text is much 
closer to the German than it is to the French, whereas in the second case (II), 
the converse is true, the English text being closer to the French. 

Two languages may havt a n >ober of such similarities at one level and 
not at another. There nay be similarities in the vocabulary but not in tlie 
grammar; similarities in the grammar but not in the vocabulary; similarities 
in certain features of pronunciation but noc in others. For example, in most 
respects, French aid Hungarian are quite different; but in certain features 
of pronunciation - phonemes (rounded front vowels) and pros ode mcs (syllabic 
equality) they have much in common. Likewise, English and Ciinese arc described 
as being highly different languages, yet in some respects English may be more 
similar to Giinese than it is to German. According to soundings made by 
O 




76 



74 . 



Jcspersen , i t would seen that the regularity of werd- order (SVO) in Modern 
Fngl.ish prose would range roughly botv;een a host 90% and 97%, os compared 
u 5 111 Chinese, which is close to 100c, Contrast this to German where the 
l'e gu 1 a ri t y j s b a re 1 y aben f ? 50 % . 

For practical purposes, the similarity of two languages will be seen 
in the messages or texts which the languages produce. Since students of 
language learning and bilingualism are laced wi di specific cases, it is 
important to discover techniques for dealing with them. One way is to 
analyse, and measure not only the differences between the cedes, but also the 
distances between the messages or samples of discourse winch the codes gener- 
ate within comparable contexts. 

In order to be comparable , two texts must be equivalent, A greeting is 
comparable to a greeting and not with something else. Any two s tret dies of 
speech which convey the same massage - a sequence of sentences, a phrase, a 
word, or a morpheme - can bo juxtaposwd to an equivalent in the other language. 
One can even imagine two languages as two equivalent, endless and interfacing 
texts . 

The operations necessary for bringing about this equivalence are f met ions 
of the degree of difference - the distance - between the two languages, and 
can there fore be used as a measure of this distance. These operations range 
from the simple use of homonyms (homophones and homographs) , which are the 
same in both languages, to complex changes in structure and level. 

It is therefore possible to establish a scale which would reflect this 
di f ference. If we postulate equivalence in meaning we measure differences in 
form and function. We can start either from the orthographic form or from 
phonetically transcribed sample? of speech. For purposes of consistency 
and simplicity, we shall start with the orthographic texts. 
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j ; equivalence o j two such interfacing texts poses two prch lev.? , 

Within each language, there is a difference between what must be said and 
what Tiiay be said. In some languages there nay be only one way of expressing 
a gi ven no s s a go ; In ot lie rs , til o re m ay b c many v; ays of i:t. t c ri n g t h c equivalent 
message. In one language there may be more freedom than in the other. And 
tli is difference in degree of freedom is itself n measure of the distance 
between languages. This degree of freedom is greatly restricted once the 
stj'cUh of discourse hits to function In a context of situation (See 5.2 above ). 

To be able to compare any two equivalent s tret dies of speech, however, 
and measure the difference between them, one must fim knew the features in 
which one text can di ffer from another. 

In measuring the difference between two languages in semantically equival- 
ent texts, one can either begin by analysing and classifying their difference?, 
giving values to each type, that Is, by taxonomic me a moment , or one can 
arrive at a single measure of the texts as they appear face to face, that is, 
by integral . me a? urement . 

In the first case, taxonomi c measurement , one may have to go beyond what 
niects the eye and relate the text to the code which produced it; in the 
second case, one measures only what is overtly expressed and is directly ob- 
servable, Let us begin by analysing the fii'st case, the taxonomic distance 
in discourse. 
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4 



IV XON'OMIC DISTANCES fN DISCOURSE 



4.1 Di I - Ci i s i ons o V D L f f c ren ce 

4.1.1 A Typology of Diroensi onal Differences 

4.1.2 An A1 geb ra of Di jnc n s i oi i a 1 1)1 f f e ren ces 

4. 1.2.1 Notation of Hierarchy 
4.]. 2. 2 Notation of Order 
4 . 1 . 2 . 3 Not at i on of 1 ength 

4. 1.2.4 Notation of Depth 

4.1.2.5 Notation of HI events 

4 . 1 . 2 . 6 Not at i on of Del at i on , 

4.1. 2.7 Notation of Set 



4.1.3 Differential Analysis 

4. 1.3.1 Nunber of Dirensions j 

4. 1.3.2 Components of Differences / 



4.2 Degrees of Difference / 

4.2.1 Conversion of Hierarchy ? 

4.2.2 Conversion of Order f 

4.2.3 Conversion of Length 

4.2.4 Conversion of Depth 

4.2.5 Conversion of Elements 

4.2.6 Conversion of Delation 

4.2.7 Conversion of Set 

4.2.8 Multidimensional Conversion Distance 
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4. TAXON GMT C DISTANCES IN DISCOURSE 

Equivalent stretches of speech can differ in several ways, since all 
languages have various roans of expression, such as intonation, vocabulary, 
and word-order. Within each of these areas or dimensions there are great 
differences between languages ■■ both in type and in degree . In taxonomic 
racasurement , differences of type within tV’se dimensions of interlingual dif- 
ference must precede the measurement of the degrees of difference. 

In order to find measures for different types of di ff „ renee, it is first 
necessary to determine what these types are^ One must begin by ashing a basic 
question: How can two languages differ in the texts they produce? The diron- 

sions of difference must be determined before the degree of difference. 

4.1 Pi mens i ons of Pi f Ter ence 

To examine the dimensions of i n t o rli ng.ua 1 difference in discourse, 
the different types will first be isolated and an efficient way developed of 
referring to them. In other words, before one can engage in any sort of 
taxonomic Measurement , both a typology and an algebra cf language difference* 
are required. 

4. 1. 1 A T ypology of Pi non s ion a 1 Dif ferences 

Hew can languages differ in form? They should be able to 
differ in the dimensions common to all languages. V.hat are these dimen- 
sions? It seems evident, as we have seen, that all languages, by their very 
nature must have some sort of elements (sounds, woi'ds, sentences) some of which 
are more inclusive than others, thu' farming a hierarchy of inclusion. These 
elements must appear in some order in speech, one before the other. These 
sequences of elements must necessarily hove length, some being longer than 
others. The elements, belonging as they do to classes, must also have depth, 
some permitting more substitutions than others. These elements ore conposcd 
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of units which are visually or acoustically perceptible, as being identical 
or equivalent- Some of these elements arc bound to have relations with other 
elements. And finally, all elements belong to sets, the members of which have 
similar functions. 

Two equivalent language texts nay differ in any or all of 
these dimensions. They may differ in the number of echelons in their hier- 
archies, as might be the case between a highly agglutinative language like 
Eskimo md a highly analytical one like English. They may differ in the re- 
quired ind optional order of their elements, functioning as units within a 
structure, as in the case of word order in French, German, English and Latin. 

They may differ in their length, as measured b> the number of elements in the 
corresponding units, as would be evident from comparison of Hungarian and 
Chinese words. Some elements, however, may not even have an equivalent: in the 
other language; others jiiay be left unstated in discourse, so that their length 
is zero. Others may have an equivalent with sc nothing added, as when a veil) 
in one language is equivalent to a verb and a jrhrase in the other. In some 
cases, ;his counterpart , belonging to a different rank in the hierarchy - as 
when a pronoun in one language has to be rendeied by a group of words in the 
other language - entails by that very fact a difference in length. Equivalent 
texts m;r also differ in their depth as measured by the number of alternative 
units winch can replace each unit in the utterance; the pronominal units of 
Japaneses or Bengali give more alternatives than do Dutch or English. D?pth 
is a ine t 'isure of what may be considered as equivalent. Most obviously, languages 
suffer in the configurations of the elements themselves - the shape of their 
morphemes (woi'ds and word-endings); yet language contact and genetic relationship 
are the causes of many identities and similarities which tend to reduce this 
sort of distance between languages - visual identities (h wogi phs 1 \ko net U n 
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in English and Trench) and auditory s.liu? J lari ties (homophones like /bans/ in 
English and German). Equivalent: texts of. two languages may also differ in 
the niuribcr of specified isolations between the units, some, languages, like German 
inquiring changes such as the form of the ai-ticj.e and the adjective to agree 
with that of the noun; other languages, like English requiring no such formal 
relations. Finally the two languages may differ in how they organize their 
elements into such distinctive sets as form- classes , as when something expressed 
by a vc ib in French mast appear as an adjective in English. The formal organza 
lion of different languages, therefore, may differ in any or all of seven 
dimensions - hierarchy (ii) , order (0), length (L), depth (D) , element equival- 
ence (E) , relationships (U) , and set membership (S) . 

Hie first taxonomic operation is establishing which part or 
parts of two semantically equivalent stretches of discourse are equivalent in 
meaning, ’iliis can be done by interfacing the texts and segmenting them into 
equivalent units. For example: 



F, 1 1 sc lave 
E. He washes 



le matin./ 
in the morning. 



When two equivalent texts have been so segmented so that tne equivalent units 
are comparable, each pair of such units can be examined for the above types of 
di ffercnccs. 

Equivalent units in some pairs of languages will be found to 
differ in only one dimension; in some pairs they may differ in two, others 
in three, in four, or in all seven dimensions. For example, the English and 
Geunan personal pronouns, although they have the same nur.ber (depth), differ 
in their required relationship, whereas in English and French they differ in 
both depth and relation; in one case we have a one dimensional difference, 
in the other, a two dimensional one. For each feature of any pair of 



languages being thus ccmpaied or contrasted, there ere many possibilities, 
ranging from a single difference in hierarchy (il) to seven simultaneous dif- 
ferences (HOLDERS) (See Table 9 ), 



Below are examples of differences in one, two, tlncc, four, 
five, six, and all of the seven dimensions and also examples of a number of 
selected combinations of these components. So as to be able to identify 
behind the particular words and word grou^ , the type of arrangements and 
relationships involved, some way of noting essential differences has to be 
worked ont. For differences within each of the dimensions are not all of the 
same type or extent. In oi'der to deal with them in specific cases, some sort 
of simple but adequate notation has to be developed - indeed a sort of algebra, 
in the sense of a generalized systematic use 06 symbols to express and analyse 
the relationship between concepts. 

4.1.2 An Algebra o f Dimensi onal P i f T erences 

In devising a simple notation for the analysis of language 
differences in each of the above dimensions, it is important to keep in mind 
the existence of synbols already in use in both linguistics and mathematics, 
while attempting to solve conflicts with the maximum of simplicity and elegance. 
Tins can be done with the help of a simple mod;l of two different but equivalent 
t e xt s ( Sec Figure 2) „ 

If the two languages are symbolized as Language A and Language X, 
the units of *hcir texts as ABC arid XYZ, respect i vely , the constituent elements 
of each of these units can be /a be/ and /xyz/, so that Unit A is equivalent (O) 
to Unit X, B to Y, and C to Z ( See Table 10 ), 

With this basic notation we arc able to express differences in 
hierarchy, order, length, depth, element equivalents or identity, relationship, 
and set membership of two equivalent texts con be expressed with supplementary 

notation of variables peculiar to certain dimensions. 

O 
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i 
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DIMENSION'S OF LANGUAGE DIFFERENCE 




If . ) 



C) 
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Types of differences in: Hierarchy (H) , Order (0), Length (L} , Depth (D) , Elements (E) , Relation (R) , 



Figure 2 

DIFFERENTIAL MODEL 



^r:7 

A jW 
! i»>\/ 



/z:*7\ 
j 13 >>• 

...jy 



< 



1 ' 


~z 

/ 


1 c 


c" 


1...J3 


7 



zrz 

x 



i 



._JZ 



Zx 7 



y 



._lyz 



Algebraic node! of equivalent texts in Languages A and X, showing 
hierarchy (Aa) , order (ARC), length (1-2-3), depth (a n ) , elements (a,b,c), 
relations < CA,6>and set membership ^A,X^ 

Equi valence (O) : 

A O* X , B -O Y , C =0= Z. 
Congruence ) : 

A ££ X , B as Y , C & Z. 

Identit y ( = ) : 

C = Z 

Differe nce ( ~ ) : 

(ABC) ~ (XYZ) = Ay? X) + B ~ Y) 



Example : 



A 


b 


c 


une 

i i 


grande 

f 


tab le 

j 


I 

a 


1 

big 


i 

t ab 1 e ! 


x ! 


y 


1 j 




t 
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TABUS 10 

SIGNS A\ T D SYMBOLS FOR TAXONOMIC M;ASURIiM:NT 



RANKS 

A 0 sentence 
A: clause 

A; phrase 
A y group 

A. word 
•A morpheme 
pro socle n.o 
A7 phoneme 
A^ allophone 



UNITS 


CATEGORIES 




GROUPINGS 


A 


X 


N nominal n 


pronoun 


(a, a) constituents 


B 


Y 


V verbal v 


aux. verb 




C 


Z 


D adjectival d 


determiner 


< a ,b>* re 1 at i on s 






A adverbial a 


str. adverb 




ELEMENTS 
a x 


c conjunction p 


preposition 


|a,b| sets 


b 


y 


RELATIONS 




c 


z 


Equivalence =C= 




non-sets 






Congruence — 










Identity 










Equality — 










Difference 
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If hierarchy is further symbolized by quasi- conven- 
tional sub-scripts, a limited number of markings is capable of describing any 
equivalence or difference in rank, from sentence to allophone. These are: 
sentence (A c ) , clause (A:), phrase (A;), word group (A,), word (A.), morpheme 
(-A) , prosodeme (A), phoneme C/A/), and allophone (Ca! ). 

For example, in the following equivalent texts, one 
segment, a French phrase (A;), n 1 import s qni , is equivalent in meaning to an 
English falling- rising ( v/ ) intonation pattern (X); 



F« 11s n'invitent pas n T ir.porte qivi / 
E. They don't invito ^'anybody. 



A, 



X 



4, 1,2.2 Not at ion of Order 

If A is equivalent in meaning to X, B to Y, and C to Z, 
identity or difference in order can be indicated simply by the sequence of the 
letters. For example, tho French' word repon d" (A.) is equivalent to the English 
word answers (X.) and F. tou jours (B.) to E. a lways (Y.) in the following 
sentences : 



F. II r^pond to it jour s / 
E. He always answers 



A.B . 



Y.X. 



4 . 1 , 2 . 3 Notation of Leng th 

Length is indicated simply by the nurber of units. 

To avoid confusion with differences in hierarchy, however, it is useful to 
stay within the boundaries of the l'ank being n ;ed, since the higher the rank, 
che longer the sequence of units is likely to be. But not necessarily: Thanks . 
can be just as much a sentence as I thank you very much . In the following 



example, a French verb comprising of a single word is expressed in English in 
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F. Chorcher / 



A. 



1*, look for. 



*0 X. Y. 



Length can also be indicated by numbering each coinponent. (See below . ) 



4, 1.2,4 Nota tion o f Depth 

Depth can be considered as the lumber of alternative 
equivalents which the code permits - not only the number of different vocabu- 
lary equivalents, but also the number of phrase, clause, and sentence equival- 
ents; in sun, the miuber of different ways of establishing an equivalent text, 

In the example below, the French word jugur can be rci idered in two ways in 
En g 1 i s h , by glow 0 r g l e a m : 

F. Lucur / X. 

E * G 1 ow Y . 

Gleam 

fhpth may also be indicated by noting the number of alternatives as an index 
in superscript. This is especially useful in measuring degrees of differences, 

A, = X 2 . 



4 , 1,2,5 Notation of Elements 

In noting the similarities and differences of the 
constituent elements of two equivalent texts, a lower-case letter can be used 
for the unit or segment being analysed - (a) for A, (x) for X. For example, 
in the French ”ord group (A,) pauvres parents , the first word pauvres can be 
noted as che first constituent (a^*) and the second word, parents as the second 

constituent (a^) » numbering from left to right. Comparison is also done from 
left to right, so that i f on the right side of the equation an element appears 

which is identical to its equivalent on the left side it can be so indicated, 
as is the orthographic identity of the word paren ts in both texts: 



X.B , 



or 



0 
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F. pauvres patents / 
E. poor parents 
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A(aja 2 )=c= Xfxjap 
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4 . i . 2 . 6 Notat ion of P c J aticns 

Any relationship that needs indicating can be expressed 
between the conventional relationship brackets < . But only significant 

differences need be marked. For example: 

F. grande maison / 



A. Br>. =C 



X.Y. 



Ii. big house. 

Many types of concord and government can thus be stated with varying degrees 
of refinement. 



4 . 1 . 2 . 7 Not ation of Set 

If two units or elements are not members of the same 
or equivalent set (form- cl asses , parts of speech, grammatical category, phrase 
type, etc.) this can be shown by the conventional set-builder brace ^ ^ . 

To indicate that they belong to different sets, the type of category rav be 
indexed, {e.g.N for nominal, V for verbal) |A e ^=o= [x^ 

or invert the braces : for example, F. At tent ion /£. Look out. 

W *=-j sv { 



4.1.3 Different ini An a lysis 

An equivalent pair may differ in only one dimension, cwo 
dimensions, three and more dimensions. The conplexity of the conversion differ- 
ence may be reflected in the complexity of the formula. 

The above notation is sufficient to indicate differences in 
any combination of dimensions; as the following examples indicated, this can 
be done both for the description of the number of differing dimensions and for the 
inclusion of various dimensional components. 

4.1.3. 1 Nunber of dimensions 



1-D Differences 



O 

ERIC A 

H fA/X): 



I*. Oui , effcctivcncnt jc suis adulte./ 
E. Yes, T ^am an adult.' 
cffcctivemcrt / ^ 
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A. X 



X 



,2~D Differences 



xf:<) . 

Yfy). 



3- D Differe nces 

A (a)^. G. Bitte (gesturing) / I;. Have a seat 

B (b), Danko, / Thanks 

C (c) w Bittc / (nil) 

HEL (A/X): A(a: v B(b) f C (c\ f «*= X (XjX^ Y(y)„ 

4- D Di fforenrcs 

F. Son frerc a le ne z long./ 

E* His brother has a long, protruding nose. 



A bk 


le 


/ 


a X Ra, 


B<b a > 


nez 


/ 


nose Y (y) 


C (c). 


long 


/ 


long, Z fc) 

protruding, Z (z) 


> 

X 

> 


>.j lt B<b«s> CCc) «©= X £ 


Z ">- 
^ Z M . 


Y (y) 


or; 


Ja.| b*>. c 


H«S: 


Y. 



X ( X |^2 X ^' 
Y C>% 



F. Ce fut le plus grand qu o 1 1 on ait ja mais vu / 
E. It was the largest ever seen. 

A ( ( a i a 2 a 3 a 4 a s) : f l ue l f ° a t] * * jamais / ever 

B(b). vu / seen 

HE (A/X): A ;a 1 a 2 a 3 a 4 a 5 ): B(bl X(x) . Y(y), 
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5-D inferences 



I\ I o ch i cn ctei t en train d * on 1c ye r a vc c 1 a 1 nn gne 
du ch ocolat g nL sc trouvait sur sa figure 

E. The dog was licking chocolate off his face. 



A; 


otait en train 


dc / 


was , , 


. .-Ing 


X 


(*•->') 


BfbOtojb: 


< d T cn lever cvcc 

qui se trouvalt 


la Icing ue / 
sur / 


licic . 


.. off 


Y 




C, 


du chocolat 


/ 


chocolate 


Z 


• 


MOLES (A>X) : 


A; B(h;}c, |b tl 


^ X (x. -y) 


Y(y.)z. 


H 







6-D I )i ffe lan ces 

1 ; . On d i t quo 3 c r oi Ab du 1_ est Ion;,. rrent p arti . / 
E . King Abdul has nl ledge :1 3y lef t fc n g ag o . 



A: On dit quo / 

B , lc roi Abdul / 

C (cj) ] c (<^ 3 bj>- est longoment/ 
* parti / 



or 

C( c i). ?c 2 z 2 ^. 1 'c 3 ,c ] bj>. 



al lcdgcdly 
King Abdul 

left long ngo 



X. 



Y(y bj ) , 

Z(',) $2,) ; 



or 

C C! C 2 c 3 <t l b^. 

HOLERS (A/X): A:B (bjbjb-) .Ccj^ c 2 c 0 < Cl bp- 




X. Y (y b 5 ) , Z Zj?z 2 c 2 $, 



7-D Differences 



A( a i a 2 a 3l 




F. 1 /? profosscur Martin aurait e tc' conge die' sa ns cere: :onic . 

E. It scens that Professor Martin has teen summarily fired 
(let off). 

Lc professc in* Martin / Professor Martin X (x a 5 ) 

-ai t /it seens that ' Y ; 
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C(ci-b)^ C 2 i ^ nuv- cl v con ;,e d i c / 1ms been summarily Z 

^ sans cerev;<*nie . fnvd (jot off) 



G-H/) 

IIOIWA'S (A/X): A (.ijO^) C (c r W K C. 2 <<^l) jc 3 ,j O Y; X zfz, ^ z 2 ) 



4 . 1. 5. 2 Compon ents of LM f for on ees 

In each of these groins (1-7 dinonsiens) which have 
jusl been exempli fied, the actual dimensional components nay be quite different. 
Tor example, a u,,; -dimensional difference may be of hie :*arehy and order (HO), 
length and relations (LR) or any of the other 1? combinations of tw u- di ronsi.cn a 1 
components ( See Table 9.) Here are a fev: examples of such combinations: 

F. (s) x: J. (ss) 



L 


A =o- A 11 


F. Mayonnaise/ J. Mayen 


;CIU 


0 


A.R.^ R.A. 


F. credit social / };. 


social 


SE 


KrH, 


F. Love 2 la main./ li. 


Li ft m 


he 


A * z&z V 
A o 


F. Ifa vrai sauvnge/ E, 


He's iL 



sun <a,j k n.c ; 
u;o a.r.a.-^s: x.y. 

Ui? A.IjfC- x. 



•\ 



DES 



ft- H. 



M» Y ’ Z \ 

^ 1 Jjpup b os t a l a ?ilu o* / }■_« It I coke i|Vf> rain 9 
F. II sc mod chcz-Iui ».-n voiture,/ H, He drives home. 



F. II faut 1c frappor a orod" cou ps. / L* , You've got 

t o thump i t . 

F. II se rendit fier jOrguel leux. / F.He surrendered 

proudly. 



EL I A,li. =c-- X,V.2<):.z 1- : F. II n /etait pas pretjn' e st - h, He wasn't readv, 

cc .p as? " f wasTTeT “ 

jjigg. =c * j Y, | (*•) *’• 11 w -itg ti 1 u-i ms lourj. 1 'eacxLier.,/ 

or. * I 

H { ^ K* He plo ldeci u> the stsi^s. 



o 
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A . 2 Deg rees of Dj / fci vn ce_ 

Differences between languages are m: only matters of type; they are 
also matters of degree. Differences in length, for example, vary according to 
the lumber of units by which one stretch of speech in one language exceeds its 
equivalent in the other language. 



These car be incorporated as indices of constituent elements - (aj_a2 fl 3 fl 4 £, i0 • 

Using the simple procedures of ordinary algebra, both positive and negative numbers 
can be include i, each side of the equation standing for a stretch of discourse 
in one of the languages: (A - A) and(A - A = 0) . 



two equivalent: stretches of speech can thus be measured. In measuring the 
conversion distance, it is essential to begin by est ablisli ing exactly what is 
being cor verted into what, since the operation in one direction may be marc com- 
plex chat it would be in the other. One might imagine that the point of departure, 
or source language (S) would be on the left of ti e equation, and the point of 
arrival, or target language (T) would be on the right. But such is not the case, 
since the difference must represent the number additional distinctions to be 
made by the speaker of the source language in orcer to equal those of the target 
language, hie formula for measuring the taxonomic conversion distance in dis- 



while increasing the range of possible sequences, one set of letters is suffi- 
cient beginning with the first letter of the alphabet. 



In problems of measurement, numbers as well as synbols are needed. 



Hither the conversion distance or life Jifferent i al distance between 



course must therefore be: D(S/T) 






Tlius, with a slight modification in the notation, degrees of differ- 



ence between two equivalent texts can be measured in each of seven dimensions. 
The texts mast, first be processed to determine wl icli segments are equivalent. 



Here i 



jin example of the procedure: 



1. lUt oi-nine direction of the conversion, that is, which is 

the target language (?) rend whi cji is the source language (S) . 
W in t e t ar go t text before ( lo f t or fib o ve ) s on r ce text £ r i gh t 
or below) . 

2. Segment texts for semantic equivalence: 

l\ On dit/ qu’/ 11/ cst ])arti/ . - 5 segments 
E. lie/ has/ alledgcdly/ left. - £ segments 
total segments 10 

3. Label segments in target language sequen r i ally (ABC..) from 
Jo ft to right. 

ABC 

Chi dit qu 1 / il / cst parti./ 

4. Give same label to corresponding segment in the source 
1 an gunge. 

He / lias / alledgcdly / left./ 
BC AC 

5 Establish basic equation: ABC UCAC 

6. Work out conversion equation as indicated below. 

7. Express results as powers of each dimension of difference; 

n n n n n n n 
HOLDERS. 



In this way, conversion distances can be separately calculated for dif- 
ferences in hierarchy, order, length, element equivalence, relations and set 
menbership. Or, expressed as a formula: 




b(S/r, 

4.2.1 



= ll(T-S) 0(T-Sj L(T-S) D(T-S) E(T-S) R(T-S) S(T-S) 
Oonveisiia ef iMorrmby: |1 (T~S) 



If any element in a sentence must change its hierarchical rank 



94 



9 



so ns to be made equivalent to an element in another language, the number of 
ranks or steps it has to travel up or dam the hierarchical ladder may he used 
as a no as u re of the conversion distance an hierarchy. The numerical values 
attributable to ea :h rank or rung of the ladder must have some relation to its 
degree of inclus i vencss . The highest number is therefore attributed to the 
highest rank , the text, or for convenience, the sentence (A,,) and the lowest 
lumber to the least inclusive, the allophonc or allophonic combination (A} ). 

If wo operate with nine ranks, the sentence ) appears as Rank 9, tlie 
clause (A:) as Rrud; 8, the phrase (A;) ns Rank 7, the group (A,) as Rank 6, the 
word (A,) as Rank 5, the morpheme (-A) as Rank 4, the pros ode ne (A) as Rank 3, 
t he phoneme (/.V) ns Rank 2, and the allophonc or speech sound (fAj) as Rank 1 
at the bottom of the ladder. The number of steps one must travel up ( + value) 
or down (-valuo) the ladder to convert a stretch of speech from one language to 
another ir. this way represents the distance in hierarchical conversion. 

Tor example, to make the intonation pattern of v : mytin c in the 
source language ($) , equivalent to a stretch of speech in the target language 
(T) , i .c. \i to utc heure , we calculate how many rungs higher is the target 
from tlie source, that is, T -S. Compare: 

Target Lg. (T) 

9 A 
8 A: 

Je nc suis pas libre a touto heure 7 A; 



O 
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6 A, 


A, 


5 A. 


A. 


1 i\ 


-A 


3 A 


A 


2 /A/ 


/A/ 


1 A 


A 



Source Lg. (S) 
A 9 
A: 8 
A; 7 



II (T - S) * A; - A = 7A - 3A « 4 
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By piefixing the corresponding rank values to each stretch of speech being 
measured, we c«u : T mply subtract one from the c«'her; as in the above exmiplc. 

4,2.? Co n vc rs i. on_ o f _0 r do v : A (T-S) 

] f the order of a stretch of speech in one language must be 
changed to make it conguont with its equivalent in another language, the 
extent of such a change may be used as a measure of the conversion distance to 
be travelled. The extent of the conversion can bo measured by the lumber of 
changes in position that have to be made, and the difference in the position 
that equivalents nay have in each sequence. These differences may be quanti- 
fied simply Ly calculating the number and range of position switches needed no make 
equivalents congruent. Number the constituent elerfcnts in each sequence after 
interfacing their equivalents. Tor example, \\ cl > s c v va t i on u nig uc / B. wni_que 
observation, would be B^A Difference must, of course, appear in 

posit \ \ i n u/i'ib 0 vs , s i n co ( » i de v , hi? i n g e xc 1 us i vo 1 y d 1. f fo iv n t i a J , does not :i n vo 1 vc 

questions of more or ] ss. If the measurement is in conversion rather than 

A 1 U 2 

differential di : lance, the results can be divided by two. So that, ^ g 

1 + 1*1 

2 

The difference may be that of a single item removed fi-om the position of the 
equivalent one, as the adjective in the above cxa.nplc comes before the noun in 
one language and after it in the other. Or the distance between the items may 
encompass most of the wo rdf in the sentence. Compare: 

(I. Rr hattc die Fes tun g allcin go gen clner Angriff vertcidigt. / 

)!. Me had defended the fortress alone against an attack. 

Segmenting one text according to the equivalents of the other 
and by subtracting the order nurber of the equivalents, gives the following: 
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1234 56 7 8 9 

A: lir halt-: die ft* stun" allcin go gen cj.ner An griff verteidigt. 

X: lie had the fortress alone against an attack defended. 

12 4 5 6 7 8 9 3 

0 + 0+1 + 1 + 1 + 1 + 1+1 + 6=12 

4,2.3 C on ver sa o n of Length : _L_ (T S)_ 

Anyone who has compared a translation with its original my 
have been struck by the fact that one of the versions takes more space than 
the other, or that a single word in the source language has been rendered by a 
number of words in the target language. Differences in length are measured in 
terms of immediate constituents, the length of words in syllables, phonemes or 
graphemes - A (abc) . - the length of word groups in words - A (a.b.c.), - and so 
on. This is not the case for differences in hierarchy, some of which also involve 
differences in length. The difference may be ro?asured for each of the equivalent 
segments by counting the immediate constituent units (phonemes or letters, 
syllables, morphemes, word groups , pa rase? , clauses or sentences) and subtract- 
ing one from the ether. 

r. simple way to calculate difference in length is to subtract 
the values of the final indices of etch equivalent segment, for example, to 
convert f P . — E.) French lo viol lard to English the old man , requires one extra word. 



L (E.-F.) = A (a],') 2 a 3 ), - A (32:12), = a 3 ’ a 2 = 3 - 2 = 1. 

Converting in the opposite direction: L (F. - E.) = A (a^) A ^^233) , 

= 2- 3 = - 1, or one word less. 

Differences in length may be contiguous ( ~ ) 

or non-cont i guous (_ ). It is therefore necessary to interface both 

texts for optimal equivalence in order to be able to measure with accuracy any 
difference in length ( Sec below ) . Compare, for exanple, the English (the p os t ) 
with the French equivalent (les post o s) and with the Japanese (posutu). 
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F* pOStCS rs 6 



J* posutti = 6 



E. post = 4 



E. pos t = 4_ 



2 



2 



l (f.- no-- 2 



L (J.- E.)* = 2 



Differences in order, already measured by a previous formula, 
will, in this way, not be confused with differences in length. 

In analysing two equivalent segments, however, there is always 
the possibility of confusion between length and hierarchy* Examine Die second 
word in the Hungarian sentence: M En hot c lb an vagyok, ncm rokonaiwnal. ,T (I'm 

in a hotel, not with Jiry relatives.) Even if another word like "szal j o d ab n:i 1f 
were used in this type of sentence, it would still be a single word* In the 
German equivalent, however, it could be two words (Di! Jjotjel) , and in r rcnch, 
three words (dnns^un hotel). Conparo: 



hotclban 


im 


Hotel 


daJis 


U]1 


hote 1 


in 


a 


hotel 



* 1 (E* - 11.) = 3 - 

= 2 (Is.. - GO - 3 - 

= 3 (E. « FO = 3 - 



1 » 2 
2 = 1 
3=0 



Therefore, one might imagine that: 



L (E. - II.) ; * 2 L (E* - G.) ; = 1 L {E. - F.) ; = 0 

There is something fundamentally different, however, between 
the structure of Hungarian on the one hand and that of German, French and English 
on the other. The difference of the equivalence is clearly of another dironsicn - 
that of hierarchy. For the Hungarian equivalent is expressed within the word- 
structure of the language. One can either be satisfied with this distinction and 
express the difference as: II (E* - if.) - 7A - SA » 2. Or the difference in 
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length can also be narked: (L (Il« - Ik) = A (a a a J, - A (a ) - a - a = 2. 

12 3 1 3 1 

Since indices of constituents renumbered sequentially 
^* i l a 2 a 3^ f° r eac ^ lajiguagc do not permit one to distinguish between the consti- 
tuents of one language and those of the other, the same number cannot now be used 
to represent different constituents . If the s are indices are to be used 

for both measurement of length and element equi valence, then the sequence of 
constituent elements of the source language rust follow those of the target 
language A ( a j ^ a ^) - A [a^a^) r Therefore, the sequences must be interfaced 
from left to right, with a value of one for o *ch constituent (See h*?»l > below ) , 

4.2.4 Conversion of Depth: P (T - S) 

Anyone who has attempted to translate a text from one language 
into another may have been embarassed by the number of ways the s arc thing c;n 
be expressed in the target language. Seme languages i a ay be richer than other in 
certain types of expressions; they may, as it were, go deeper into the subject. 

The difficulty of measuring differences in depth between two 
equivalent stretches of speech is that the alternatives arc seldom expressed. 

In many cases it is necessary to go beyond the textual evidence and measure 
alternatives from the code. The noree alternatives , the more powerful the language 
is in this ai'ca of "pression. Indeed, these alternatives can be conceived as 
powers to be added and subtracted: the total number of the powers in the source 

language can be subtracted from the powe v value of the corresponding segment 
in the target language. 'Hi is gives a measure of distance in depth. Examine 
the following: 

shine 

glow’ 

P. Je l f ai vu bri lie r / P. 1 saw it gleam 

glitter 
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shi mmor 
flare 

beau 
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Or, D (E. - I'.) = A . 7 - A . 1 = 

Depth measure .ncnt in ay bo applied net only to word differences 
but indeed to differences in any rank of tho hierarchy. 

Since conversion distance measures only one direction at a tine 
one must avoid concluding that tho fact of a target language being able to 
express the sane thing in a nuniber of ways means that the source language has 
only one way of expressing it. In order to find out the difference in the po- 
tential of each language, the number of possible ways of saying the seme tiling 
in one language would have to be subtracted from the number of ways of saying it 
in the ot h e r (Sc e_ 3^ 1_ an d 5.2 ), 

4.2.5 Conversion^ of lileinents : \ : . (T - S) 

People learning a foreign language are sometimes pleasantly 
surprised to find that a certain word is the same in form and r:rning as it is 
in tlicir mother tongue. A lcarnci* of French or inglish, for example, record ?.es 
such written words ns page , pip e and nat i on as familiar. 71ic presence of such 
words which are contextually identical in form and meaning nay make it easier 
to convert comprehension habits from one language to t lie other, since the con- 
version distance is thereby induced. Rut how cm this conversion distance be 
Treasured? The simplest way would be to subtract the number of identical con- 
stituents from the nunfocr of different ones, making sure to consider only the 
immediate constituents of the segments being compared. For example; 

P. \i son hote l / E. at h is hote l = 

A (aja^) , = A (a^a.) = 




a l a 2 a 3 



1 + 1+ 00 + 1 = 4 - E (F. - F ) *- 4 
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The value of an identical word in two equivalent sclents, 



however, depends on the length of sc gments ; one in three makes the text more 
similar than one in six. Since the segments may not be equal in length, the 
number of their identical interfacing elements can be expressed as a proportion 
of their combined total. Tor example: 

F. \ la page six / Ih on p age six 



A fa, n,. a „a .) = 

v 1 2 o <r 



A C a s a 3 a 4> 



a l a 2 a 3 a 4 



= 4 



a.,, a a 
^ 4 5 



1 +1+0+0 > 1 = .3 



B (F. - K.) = 3/7 * .43 



Since wc are calculating in temps of immediate constituents, 
t; e take into account only identities at the level of these constituents, e. g. 
a word is either the same or different.. In the last analysis, to take all 
similarities into account, we must deal in the constituents of constituents, e.g. 
the let ters, graphemes or phonemes of the word. For example: 

F. une longue pi on die / E. a l ong plan k 



O 
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> 

1 

> 

*0 

11 


h 


( a l a 2 a 3-* " A 2 


(a 4 ) 


= 4/4 


= 1 




Bi - B 2 = 




( b l b 2 b 3 b 4 b S*V 


B 2 


{bi b 2 b 3 b 4 ) 


= 2/10 = 


.20 


C i - c 2 = 


c i 


C c 1 c 2 c 3 c 4 c 5 c 6 c 7^ 


* C 2 


(ci c 2 c 3 c 4 ) 


= 4/12 r 


.33 


So that E (F. 


- E. 


) = F. CAjBjCj) 


- E. 


C A 2 B 2 C 2^ = 


1 + .20 + 
3 


. 33 
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4.2,6 Coinv rs 5 on of 1 o:\oj Ji (T S) 

Tv* o e q u i v a lent, s t re t ch e 5 o f s p e c d i r » ay be identical in h i. c ■ r - 

archy , order, length, and depth of their constituent elements aid yet differ 

widely in the number of obligatory relations between elements. These constraints 

may be no as u rod by counting their number in cadi segment and by subtracting the 

totals, _ 

3 c vieux cli e yaj_ the _ol d horse 

French. / English. 

la vicille maison the old house 

F, (2) - H.(0) = ? 

In other words, in each of i ho two equivalent 3 -word stretches 
of speech, there are two constraints more in French than there are in English. 

Or, R (F. - I;.) - 2 . With the sam: notation as above, relations nay 
be indicated thus^C > , each being applied to a constraint in the system 
[See 4,3 .12). Degree of difference is obtained by counting the lumber cf such 
relations on eacJi side of the equation and subtracting the difference. For 
example : 

F. ccs vicl lc s chauss urcs / F , these old sho es 
A c^a 3>* - A <C.a,j (a r 0 n^\-> = A*<2> - A<0 > ~ 1 

As with length, relations may be contiguous or non-conti guous ; they can there- 
fore be computed in the same fashion (See 4.2.3). 

To obtain more precise measurements, the relationships may be 
categorized; for example, nuirber (1), gender (2), ease (3), etc. , so that 
difference in type of relation can, if desired, be tab on i to r count. Consider 
the following: F. I Is ont leva la main . / F. They lif'r'd up tlici r ha nds . 

<*** - <*i - 1 

Since if la and H. t heir do not belong to the sar.o sub-set of determiners, this 
too would have to be taken into account (Se e 4 .2,7 below ). 

O 
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4.2.7 (frfl vo is ion of Sits: S (T~S) 

If Tin i toy, be longs to one set or l ate gory ( c . g , romnnl) in 
one language, its equivalent in another Jan gunge ray belong tc a different 
category. Verbs way scire l ivies be equivalent to nouns (e.g. C . rent a car. / 

F. Lo c a t. ion d f a v ■ i; os . ) and nouns to verbs (E. Aijeunrent: 5 p.n./ 

F. La so anc o s' est l ove a 17 h.). In fact, any item in any language can theor- 
etically belong to a different category or grammatical set when translated into 
an oilier language. 

In measuring this div.rnsien of difference, one nay cither take 
into account only the identity or lack of identity in set or tjv to measure the 
degree of difference. In the first ease, the constituents which do not belong 
to the same set as their equivalents may l>c marked by inverted (non-set) braces. 
For example: 

E . Cargl c twice a day, / F. Got mris mo deux fots par jour. 

a '-K ■ K 



In the second ease, the differences in set membership of 
equivalent elements maybe measured along any appropriate seal;; of difference 
between whatever sets each language po: es. For example: 

E. He was ski-ing. / F. II faisait du ski . 

Yt - W ■ 3 

In French and English, scales may be so refined ;is to include differences between 
sub-classes of determiners , or they c?li be limited to such gross distinctions as 
nouns (N) , adjectives (0), adverbs (A) and verbs (V) and other structure words 
(S) . Compare; 



O 
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English 




French 


N 1 


_r 


_ 1 N J T ai faim. 


I'm h un g iy D 2 


2 D 


A3 




3 A 


V A 




A V 


S 5 




S S 


H 


TOG 


H ■ • 
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better 

texts. 

I 

If 



mi 

DCS 

IfS 



o 




d . 2 . 8 Multi di mens! <>nal _Ton vi- vs ?. on R" st suee 

Values can uov be given to cadi cUrension , obtaining a 
picture of the extent and degree of difference between two equivalent 
To make clear how this can be dene, here are a few examples: 



F. manicure. / J. menikyua s/r*$ 
A = AA = L (1-2) 



~] 



F. Scs machoiros eta lent corbie s niuluo 5 . / 
];. llis jaws v.vre locked. 

A Cu 1 n 2 ) - A (a;) - L(?.«l), F (3) 

F. t_rot V i ne_r / I; . t^o_ddlc_ al ui < 

A(a ; ) - A (n 2 a : ,) - L(l-2) , H (3) 



r 1 b 3 



-1 3 
L H 



F. File ve r onne lupi*A ^vr'-nt . / 

\\, It has a sinister sound. 

= S(S-l), I! (3-6), li(l+2) 



H. - H 



F. Ill sc tort ura a computer son impot. / 

1: * He wracked his brains st niggling, 
strrf ning to figure out his taxes. 



M; • F); - 



D(l-2) , li (1+2) , S (5-3) 



-2 -1 3 
S H I: 



D - l E S S t2 



F. 11 sc suit it vide dc tout. / 

E. All he felt was an emptiness. 

A 4a,n,a J: - A )a,{ « L ( + 2), 1 (<1) , S(+l) = L + V S U 



b^'l ■ a H„ 



3 0 ?. 
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d:o h. 

r. 

A ( a ! ) . B(hjb 2 ),C(rj). A(a,). - Ala^) ,C(c 2 c : J ,B(b 3 b,,b s b 0 ) ; - 
L(2-2) + L(2-4) + L(l-2), 

e (12), o (4) - ir 3 i: 12 



1 III - 1 


:* last year's d n 




J 1 ui ].. 


:■ i ;i cux ](S rebir 


do 



LLiOS 1 : . 1:11c cria cm inquire coiv.ro u;io. perdue, / 

11- She shrieked bine murder. 



A ( n i ) - ” C 0'iLbz), AO^a,,); - A(fj,).B (W b 5 ) , - 

EC-1-1), H(9), o(J 2 ] : 2 )» S(5-2) « L +3 IiVs*- 5 

these taxonomic ne arum: men ts haw the advautag.: of indicating not only the. 
degree of lUffereico between languages as r.ani lost ed in their production of ills* 
course blit also the dimensions in which they differ. They indicate both the 
how and the how much. They have the disadvantages* however, of obliging one* to 
calculate as many measures as there are dimensions, each measurement being 
dependent on corr?ct and accurate cl ossi f L cat i on and interpretation of the 
categories i r the language being compare d. One must know something about how 
the codes have produced tho corresponding messages, what other rltcmativcs 
arc possible, whether relations are coripulsoiy or optional, which grammatical 
classes are being related, and many other such -points. In other words , the 
measurements measure more than meets the eye. 

A direct, objective and - for certain purposes - more useful 
typo of measurement could be obtained by limiting the differences to what is 
imrodintely observable. It would also be useful for certain purposes, instead 
of having scum i insurer, of difference between two equivalent 1 cm s , to con. 
out v:ith a single figure. 'Jhc.se objectives may be readied throiuji the 



EWC> 01 ' atlon of i'ltefiral mjRsurervnts. 



to 5 

ips. 



1 NTH URAL DISTANCES 1a lUSC.OUiUii 



5.0 JVocossing the Texts 

5.0. 1 Ort h o ■: raph i c To xt r> 

5.0. 2 Oil on ct i c To xt s 

5.0. 2.1 Sy 11 ab i c 5c j»u:cnl at ion 

5 . 0 . 2 . 2 Hi one ro S o 1 at i on 

5.1 Static Measurer out 

5.1.1 Supra- 1 ro.?odi c So pu: n ts 

5.1.2 In frn-P rosodt c Sc^;. -Tits’ 

5. 1.2. 1 Syllabic Uii is 

5. 1.2.2 Phone re Units 



5.2 Kinetic Measure rmit 

5.2.1 Supra- Prosodic Sequences 

5.2.2 inf ra- 1 * i os odi c Seq non cos 
5.2.5 Integration 
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s. i N'liiOiUi, djsva:;c!:S im 

1 i) coni radi s t :« net i on t o the t axom r. 1 c w : or arc :> of e a.ch d Lr.on$ i c: i of 
difference between two eqvi valent stretches of di scourge, is the : easurevent 
of formal differences between 1 we ’j mining texts. In this second type of 
isKtasmoK.-nt . a runn. ag text in cm-:. 1 an gunge is notched with a cove:* spending 
text in another language so as tr> show the extent of dl \ a rgen cc as a 
single measure. 

5 , 0 Pro cc s s :i r. p_ t h ? IT x t s 

Koforc nay syst or*r:t ic u:nsi?voncnt is rude, however, it is mcsfuJ to 
process the tv:o texts in such a way as to avoid errors in analysis and com- 
putation. The processing involves three main operations - interfacing, segment & l 
nin t ch i n g , an d ) nir.ro r leal rim rl< i n g . 

To begin will), L* ot h texts are interfaced, one trader the other, Ihe 
interfaced texts arc lb a studied to find out exactly what corresponds to what. 
And on this basis lue 1 1 >ts are segmented. Sc gixnt at i on by layers of immediate 
constituents, as is the common practice in uni lingual descriptive analysis, may 
not a 1 w a> s be th c h t y p c , s i n cc i n 1 on ge v or r,\> re c orp 1 c x s t rc-t chc s of 
discourse, it is lihely to prolong the analytical phase of the operation with- 
out contributing much to the end product. What is suggested hej'e is a type of 
segmentation 1/ minimal equivalence. This minimal equivalence represents the 
smallest segments in each text that can, irrespective of order, be considered 
as equivalent, 

bach scgi.i..-. 1 one text is then matched by connecting lines to the 
corresponding segment in the other text, and, if necessary, respaced. Mat died 
segments can be inaT o 1 and mirocrcd to permit easy identification, for example; 



Le pa' run 


a subi 
1 


un 

1 


jr auvni s 
1 


coup . / 
1 


'ihe hors 


had 


a s t rob c of 


bad 


lu:k, • 




2 


7> 


4 


5 
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Tr \ir. produces two i n t or faei n t ( ;. rc-ip-vuecs of sc pmon is of iJn.ij al semantic 

equivalence. The pi ocoss of soymmnt ai:i on tun i '- repeated until the smallest 

coi^yrtiMc. unit has been reached . The end-point will impend on the delicacy of 

analysis required. If the grapheme or letter is the ultimate unit, it is 

likely to sjioa* results somewhat different than would the use of the phoneme ns 
the smallest formal unit; but, if one is comparing the languages a: to how they 
look, it hi [lit be the more appropri ate form. In *Lh is case, one would be com- 
paring stretches of discourse in oj lho£; nphi c t ran script ion ; in the second esse, 
texts in prosodic, phonological or phonetic transcription would be required. 

Let us fiisl stud) the segmentation of ovtljo.pvnphi c texts and leave until later 
tlie study of texts in phonetic 1 1 enscri pt i on. 

5.0*1 Ortjio prnph i <: To x t s_ 

In treating the processing of texts, the operations are 
indicated on the loft and the Corresponding e xru.pl os on the right. 



°il£ l l£LL {rr 'l 



Lx pics 



1. Interface two equivalent texts 
A and X. 



A - I.o patron a subi un jnnuvais coup. 

X - I he boss had a stroke of bad luck.. 



2. Mark equivalent segments by over- 
lino ( a ) an 6 im do r 1 i no (x) indi- 
cating equivalence with vertical 
line 

a 



1 2 3 £ 5 _J>_ 

A: ke patroa i subi u n riauvais coup 

11/ 1 II 

X: ’1 he boss had a stroke of bajd luck^ 
~1 2 /T 4~“~ 5 ~6~ 



x 



3. Number segments for positions (P) 
left to right , opposite each posi- 
tion, indicating nunber of cl Clients 
(1;) (c.g. words) in the scgnvnt 



P(n) I 1 1 £ £ 6 

H(n) 112 111 

l-(x) 1113 11 



P(x) l 2 3 A 5 6 
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H 



St art i ng wi t!i i men series re- 


U(a) 


1 


1 


2 


1 


1 


1 


arrange each 1 r cc r fa ci ng sequence 
















according to lines of equivalence. 


15 (x) 


1 


l 


i 


3 




1 


subtracting the greater number 
















from the lesser, and summing the 


*J!(a-x) = 


0 


+0 


vl 


<2 




<0 = 3 


di ffcrcnccs 


















P(a) : 


1 


2 


3 


4 


5 


6 




P(x) : 


]_ 


2 


3 


4 


S 


6 




EP(a-x)» 


0 


+D 


+0 


+ 0 


*0 


-i0 = 0 


List results difference in the 


R(a-x) r 


3 












number of their elements I;(a-x) 
















and their positions in sequence 


I’(n-x) = 


0 













P (a-x) , 



6. Take each equivalent pair in turn 
for comparison of their formal 
constituents (i.c. the letters), 

7. Segment into interfacing graphemes , 
and join equivalents 



8. Total graph one S', (c) in each segment 
and subtract in real n infers the 
confined differences in length 



Le patron 
The boss 

l e patron 
lb e boss 



Ic(a-x) 



9. Add all graphenes 



10. Count matched p airs 



2 

- 3 
= 1 

2 

+ 3 
5 
2 



11. Add total graphemes in segments : Sofa* x) j 5 + 

+Xe(a + x ) 2 

12. Total identical (=) elements (p) £c(a=x) =2 + 

13. Express totals as a proportion 
or percentage. 



6 

£ 

2 

6 

__A 

10 

0 

10 

0 

2/15 



15 

2 

. 13 
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10 



2 



1 



1 



1 






Re - ; ! r r ;.!■ i go c a ch i i it c r fa ci n g s equal cc 
no c oyd i n g to linos o f ct; i: i va 3 on co , 
subt met lug the greater nv.-hcr from 
tho looser, fuel summing. the 
di f fe ronces , 



h (a) : 

R (x) * 

U(a-x) : 



} 

] 



I 0 0 

0 + 1+1-3 



ii) S on, ait i c Co r ponents 





English 




French 


1. Count number of diet ion my 






(pat ron 


X tu- an 1 Ji|' s of A c 1 o j;c*nt . 


boss 




(chef 

(conti email re 




M (a ox ) 




= 3 


2. Count number of diet ion my A 


F reach 




Fngli sh 


meanings of corresponding X 






~(b oss 


element. 


p at ron 




(master 
(head 
( ownc r 
(cnpl oyer 
(pattern 
(mode 1 




(x -O 7a) 




* 7 


3. Subtract equivalence 






,. (a~x) - 7 - 


4. Repeat calculations for other 


The/ be 




boss/p at ron 


words in text (c.g, the(30) - 








le (43) = 13) and express as 


le - 4 3 4 3 


+ 


7 7 


proportion of total meanings. 


The -30 + 30 




-3 +3 




n O~*)= 13/73 




4 /ID 


5. Interface graphic dis tnnccs(G) 


C = 3/5 


+ 


10/6 etc. = 13 


6. Reduce bolli lo percentages 


M = .19 


+ 


.04 * 




G = .6 


+ 


1.0 


7. Express as sequence of ratios 


. 19/. 6 


+ 


.04/1 + 



Such measures of interlingual distance could be experimental ly correlated with 
psycliolingui st i c measures of ini cvl i ugun 1 i dmM fi cal i on. 
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14. N o' t c d i f f e rc i \c c s I n 


so rrue a t a 1 order r , So g . 


order 


0/12 


length, v. oj'd length 


and word e lei cuts , Seg. 


length 


3/16 




V;or<l 


length 


3/36 




Word 


c ler.vh ts 


3/15 


15. Express in percent a 


ges or equivalent Sag. 


or do r 


.0 




Seg. 


length 


. 12 




Word 


1 on gi h 


.12 




Word 


c lc: ; jj i i 1 s 


. 13 


16. Total and average 


37/4 


.09 


Tills indicates the ever 


age distance in the formal 


character 


i sties of the two 



stretcher of discourse. Note that the noas'urem’nt is L» rsc i! entirely on the 
evidence supplied by the texts and does not refer to the corresponding codes. 

More information can be quantified c once mi ng the functional difference; 
between the texts , especially ns they affect the speaker, only by referring to 
the respective codes • Tins advant ago may b;> counter-bal anced, however, by the 
fact that the analysis nay bo cgi.c less objective and accurate, depending as it 
does c>n the complct mess of gramma t i cal and semantic knowledge on the operations 
of the code behind, the message. Here, nevertheless, are examples of such 
expansions designed to show differences in relationships and semantic components. 



1 . 



i) Relationships (i*) 

1 2 3 4 5 6 



Indicate syntactic dependence 
with horizontal line, thus: 




A: be patron a sub it un roauvais coup, 

u lj U _ 

n n 

X: Tlie boss had a stroke of bad luck. 
12 3 4 5 6 



2. Hunker the consul soiy categories 
of relationship (k),c.g. gender 
and number. 



R (n1 : i JL 

\_ 

R(x) 5 i 





nr> fT 1 
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Ill 



S . 0 . 2 



PIujhcm. b' Ye xts 

When the analysis 'reaches the word- level, there is the question 
of whether to use orthographic or phonetic texts. The above examples rad?- use 
of orthogrrphi c texts; let us now see what: t lie son:: texts would yield if 
analysed as tv/o stjoau; of speech. The texts cm i be first segmented into their 
constituent syllables tv as to analyse their prosotM c differences before they 
are c on s r < 1 e rc d as s c q v e n cos of pi i oi i c me s . 

S . 0 . 2 . 3 Sv H ;ib i. r S o y.i /or tot Hi, i 

A strange language may first strike cue by its 
peculiar rhythm, the pattern of aUcDTiat. ko:i between loud and quiet syllables, 
as it Hungarian, to a \iri 1 : ngunl speaker of English. Or the first tiling that 
strifes our ears i ?ny be the: rise and fail of the voice, as Chinese or Swedish 
would strike those* who do not know these 3 tinguugcs , I n each case, the first 
thing that attracts attention is not so much tin quality of the individual 
sounds, but rather the tone and loudness of the syllables. 

If the rhythm and intonation ore not too different 
to what the heni'er is used to, lie nay he able to spot some fund li ar words , 
for example, the word Canada in a Hungarian text may be twice as easy for an 
English speaker to spot as would the same word in a French text, although in 
most other respects French is more different from Hungarian than it is from 
English ( See above )• 

Languages may sound more different in their rhythm 
and intonation than in their constituent phonemes . He cause of this, homonyms 
may be all but unrecognizable when pronounced in another language, Compare , 
for example, F. devcloppersnt / E. de vc lop rent 

V. /“develop 1 n-r/ / E. /do Hole pent J 



Prosodic analysis includes features of rhythm and 
q ’ntonation; features of catenation ray be handled more convenient 1 y along with 
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no. 



the phenolic analysis. The unit of prosodic vr.zlys is is the m Us 3 no, 

tike best fit t ing intorfncj ng of r y .1 ] ab Id 3 , based, on the fore [;oi r ioxi.cn] 
matching pairs, in what respect, npa?t from its const it 1 e: ; t phem s . can one 
sy J 1 ab ] 0 di fie r f rov ; no i her? 

Syllables; con differ in loudness and Jou gth, on the one hand, 
and on hoi < h 1 and direction of voice on the other. Recurrent: changes of 
the first two features produce the audible differences in z’hythnj variations 
in the second pair reveal differences in intan a: ion. In order to be able to 
analyse and mas me those differences, a sir'] c notation, for each feature c-nd ft 
corresponding numerical scale can bo vrecl (fee 3.2.2 aVrve ), 

/pplyin^t tl , *• not : ' 1 c*: s 1 scale, by way of example, to the 
above pair of sentences, one can measure the o.v ent to vLSoh they differ 
prosod 5 , cal 1)' . 



Opera' i '-;is 



Hr 



1. Interface text equivalents and 
segment into syllables. 



(A) Le pat ron a sul i im mauvais coup, 
(k) The boas had a stroke of bad luck. 



2. Interface syllables for best 


(A) 


Le 


no tron 


a si! i 


1 

bi 


llil 1 j 


fit,e.g. stress opposite 








, ! 




j ! ! 


stress so that extra syllables 


(X) 


The 


pe, s 


)i a c 




a stroke ofi 


are the weakest. 






) ! 

\ 1 


1 




1 l 1 



r.au 



Vri^ 

i I 



bad j 



coup 1 
j 

luck \ 



3. Total blanks in bottom line. 
Total sums. This gives the 


,ii! I 

I* aaitron'a 

!' ! i I 


1 !. . 

SUM 


im 




! 


difference in syllables. 


’ihc j .boss had 

1 i 1 1 


j 




'stroke | 


|0fj 



l (a ~yj 



o 

ERfC 



raul vai s 



coin 
luc!' -5 



L (A - X) = 2 
L (X - A) = ^ 

I, (A~XJ - 6 



lib 



in. 



A . Mark >v j - i : > ; ii u 5 ; s \ 'Mi sb 1 c s :r'o r 
son If! 1 to inytbu - force (fj end 
1 c n f t h ( 1 ) , an ( 1 i n 1 01 a t j cm - L < • i pbt 
CIO > and di reel i c k C cl) of lone. 

(Sc_e_jrb k 6 . ) 

5, N arbor itcrus on line a 

according to table of values. 
(See t_nb Jo.) 



6. Repcoi ‘ for line x. 





1 




1 




1 


1 




, JL 


_ 1 I_ 


A 


YF 

i 


t r en 


1 ; 


t 


, U31 


inau 


jl-oup 






)- 


L_ 




-J 






, \ 


\ JLL 


X 


j’llie 


be 


»s s 


h ad 


|3 




bad 


luck 


f 


O) 


3 


1 


1 


1 


1 


2 






1 


(a) 


1 


1 


1 


1 


1 


2 






d 


(«) 


0 


0 


0 


0 


0 


0 






h 


(a) 


1 


3 


1 


1 


3 


0 






f 


Cx) 


0 


1 


0 


0 


1 


2 






1 


Cx) 


0 


1 


0 


0 


1 


2 






d 


(x) 


0 


0 


0 


0 


0 


1 






h 


(X) 


0 


2 


0 


0 


3 


3 







7. Interface lines 

1. f{a) - f(x) 

2. 3 (a) - l(x) 

3. d(a) - d(x) 

4. h ( n) - h (x) 



8. Calculate difference. 




Confute total. 



f (a) 111312 

f (x) 0 1 0 0 1 2 
f(a-x) 1 +0 +1 +3 +0 +0 = 3 



f(a-x) = 



1 (o) 3 1 1 1 1 2 

1 Cx) 0 10 0 12 

](n-x) 1 0 1 3 0 0 = 3 



l(a-x) = 3 



d (a) 000000 

= 1 

d (x) 0 0 _ 0 _0 0 1 

d(a-x) 0 +0 ^ 0 +0 + 0+1 = 1 

h (a) 13 13 3 0 

h (x) 0 2 0 0 3 3 

h(a-x) 13 110 3 = 7 

L (o~x) 



d(a-x) = 1 



h(a-x) = 

14 
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5.0.?,? i'll o;ic a, ! ::vicn t a e i jpn 

Hxl cncli n?: the analysis below the prosodic level, the 
phonetic transcript Ions of two equivalent texts con he interfaced to compute 
Hie constituent one! vectorial distances of their distin.-t i vc features. Con- 
tinuing, t lie iv fore, with the above example, tlio texts ir; the phonetic notation 

I 

raay be processed as foil o w s : 



Opernt i on s 



Ex a rril cs 



1. Transcribe equivalent sentences 
phonetically to include modlfi- 
c at i. on r. of eaten at i on . 



lo pat fo a si|bi go riovt ku 

bps hod o strouk ov b a: d 1 a k 



?. Interface word mid word -group 
equi vnlcnts . 



lo patf.S 



a si«M cc 



r !lOY£ tu 



bps hod 



o streak ov b:o d 1 a 1; 



3. In sequential erder, from left to 
right, interface phonemes for 
best fit within equivalent sea- 
men is - vow els wit h vewe 1 s , con- 
sonants with consonants. Mark 
(/) if there is no opposite. 



lo pat "to /a sijbi &////I // movt ku / 
'Do bps// ho lid c> strouk ov beef/ l\k 
2 i 1+2^ 1+5 + 2+ 1+1 =15 



4. Total (!) differences. 



b/a^-x/ = 15 (Sum of differences in 
number of phonemes per 
semantic equivalent) , 



5. Below each phene ns pair indicate 
intcr-phonene distance in vecto- 
rial values and add total. ( See 
Tab 1 os 7 am 1 S . ) 



1 o p a t a b ce m o v k u 

o o b p s o d a b d 1 a 

3r 1 +1 +3 +1 +2 +3 *. 2 +1 +2 +3 +5 +5 = 30 

F/a <^x/ = 30 (Sum of distances between 
matched phonemes.) 



These interfacing texts may be considered either as 
static segments or as kinetic sequences. For there are, ns v;c haw seen, two 
types of i; w.sureuxiit - static and U n?ri c , each applicable differently below the 



1.15 



in. 



p n>s o J i c 1 o c 1 v;h e to distinct! vc s o;n d f\ :• C u r w s l n t e r i r. *. o l he c :i 1 c u 1 i <. v s . 
After the texts have thus been p recessed. analytically, a few simple formulae 
can be applied f or each of these types of mvasu recent - sialic and Kinetic. Let 
us begin with static roasurcuiJil . 

5*1 St ;ihc. Measurement 

Idiot is hero required is a formula for roasurin •; the distance between 
two static i ntorfociiip texts os they appear to the eye. Since they will appear 
differently above the prosodic level than they .ill be lev, where the values of 
distinctive sound features appear, this dovo rkv. ion will hove to be taken into 
account. ‘I he? f i r: t tush is esc ab Jishiiu; an integrated formula for the reasuie- 
pCjit of distances in s up ra-p ros odi c scrnxnts. 

5*1.1 Svryrn-proso_di c Scfi onts 

One must begin, as above, by processing the text — interfacing 
and minh eilng equivalent segments (N) fios.i loft to light and indicating th oil- 
constituents (V). for example : 



1 

A. 11 


2 

ai np 


3 

be an coup 


la biere 


5 

do Muni eh . / 


N’t a) 


X He 


likes 


Mim i eh 


beer 


very much. 


N(n) 


1 


2 


5 


4 


5 





N T {a + x) ^ 10 

Here the segmental ion is or minimal equivalence. Although strati fient i c.ial 
and trails format ion segment at i on is also possible, these procedures ray sordines 
unnecessarily lengthen the analysis and computation. 

The next step is to calculate the miniver of transpositions (T) 
necessary to make both structures congruent. This is done by matching the 
equivalent positions and adding their differences in natural lumbers: 
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A. 



] 2 
il SI}];:? 



3 

bcaucoup 



00 Mu:U ch . / 



In ki'yj 



lie 1 ikes very much beer Muni ch , 

1 2 5 4 3 



A. 

X. 



1 2 
1 2 



T(A-X) - 0 + 0 + 2 + 


0 


+ 2 


- 4 




It is then possible to ; 


me a:; ure 


the diffe 


rente between the 


constituent 


verbal elements (V) by counting the 


number of 


equivalent inter- 


facing p:Jrs 


with common one-to-one cate* 


r.ories 


fc) and su 1 


itvact i ng then f‘vo:;i 


the tot al : 










A. 11 


ai me be aucoi ip ] a b i e a * 


do 


Muni eh , / 


V(R) «= 7 


X. He 


likes voxy much beer 




MujI i cli . 


VCx) = 6 










v(u4>o = n 


2 


2 2 




2 


C( u*.\) = l 



This difference in the make-up of the interfacing verb.nl con- 
stituents (VJ can be expressed by the formula: 

V(A-X) = V(a+x) - C(a+x). 

Here, V = V(7+6) - C{2 2 2 2) = 13 - 8 = 5 

Finally the difference between the units (U) comprising these 
constituents can be measured by subtracting the identical pairs (p) from the 
total nunber (n) of units, after interfacing the equivalents for maximal 
correspondence. This can be performed on texts in either orthographic or 
phoneratic transcription. For example: 



1 15 . 



0 
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b i ere = & 

) f n ~ 9 

h _e c r * - A 

2 + 2 =: ]) - >1 

U -- n - p = 9 - 4 “ 9 

/b ;if;tl/ = -1 

n = 8 

/hi 0 h/ = 4 
2 ]) = 2 
U - n - p ' 8 - 2 « G 

In this vay di st ar.ee s of homographs an 
distance in the full orthographic text: 



M u n 1 c It =■• 6 

i * « \ i i n - 12 

M a n 5 c h ~ 6 

2 + 2+2+/+2-r2 ■- ]> ~ 12 

li - n •• p - 3 2 * 3 2—0 

/n y n x )./ " 9 

1 < 3 n = 1 1 

/m ju n 1 k/ = 6 

2 + 2-12 = p s 6 

U - n ♦ ]> - ]1 -6 « S 

homophones can be di st i ngui sited. (The 
eo: es out to M - 32 - 32.) 



There are nc;, 1 three measures (U, V, and V) of kho strnciuro? 
and constituents of the i ntc r feeing texts. These m.-naures cannot be added 

because they ivpvesent essentially different things. Re fore they can be inte- 
grated they mist fi rs t be educed to the same scale. 

This can be* achieved by placing the minimum and the ina xi 
differences in each case between 0 and ]. If the minimum in cadi case is food 
at 0, the maximum for U, V and T must then be determined. In the ease of U, if 

all interfacing graphic forms are identical, then max U = n and rax (U) ~ 1* 

n 

In the case of V, if all equivalent constituents are compatible 

in category and level, then max (I Q = 1 * Or, max (n - p) = 1, 

v n 



an d nnx ( v - c) = 1 . 



v 



In the case of i , if equivalent segments occupy the same 
position then, nin(T) = 0. Put hew is the maximum determined? If all inter- 
facing segments are numbered as above, according to their position in the 
sequence, axd their position nurbers subtracted in positive lumbers, a irr.srvo 
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o\ Lhc stvu.Lv.n-: difference n the vex ts ]s obtained. Lh;:i is the? 

Jir.iV nr mad ) ax to which such differences can extend? Tins jnt xi muhi is obviously 
rebelled when the paired equivalents are in to riV.ee d in reverse order, lor 



exm v 1 o : 



, 1 2 

V . c e;;:/ c n co ad\ i n 1 s t ra t i vo / 

I: , adnn nisi rot i vo r . n yet cnee 

2 ^ 1 

1 2 3 

G* llier bin ich./ 

I: . ] tun here . 

3 2 1 



Let us examine these relations end extend them so ;-s to 
c xt ract l ho p at 1 o in . 



12 = 2 posit ions : 



2_ 1_ 






2 x 2_ 


- 2 2 


1 + 1 


- 




2 2 


2 


1 2 


3 4 


= 


4 positions : 




4 3 


2 1 




4x4 


11 

■ 

(V) 


3 + 1 


+ 1 + 3 


= 


8 2 


2 


1 2 


3 4 


5 


6 = 6 positions : 




6 S 


4 3 


2 


1 6x6 


= 6 2 


5 + 3 


+ 1 + 1 


+ 3 + 


5 = 16 2 


2 








N positions : 










N X N 


= N 2 








2 


2 


Koto, 


however. 


that 


in the above, N is always 


c ven . 
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Suppose thni 


: the total position:: 


c o i : ; s 0 ; ■ i 1 0 a: t c d . 


■ * 1 1 fa, j , f , , 


Examine* the 


foil ow.3 ng : 






1 




1 position: 








(1x1)-] --- l 2 - 


- 1 








0 


« 0 




2 2 


1 


2 3 


3 positions 


* 9 


- 5 


Jl 3_ 




ll x 3 ) -1 - r-_ 



2 + 0 h 2 - 4 2 2 



1 2 3 4 5 

5 positions: 

-5 4__ 3 2 1_ (5 x 5) - 1 * r, 2 -J 

4+2+0+2+4-12 2 2 



N - J pc? it ions 



(N XJO_ 

? 



- 1 



The formula for the maximum nunocr of structural transpositions, therefore, 

wi l± depend on whether the total number of segments is odd or even. So that if 

N is even, maxT(N) = 2 ; if N is odd, maxT(N-l) = N" - 1 

2 

Demonstration; 

NL 2 

If maxT(N) = 2 and max T(N-l) = N - 1 , this should be valid ior 

2 

N, N + 1 , N + 2 , etc. Let us demonstrate the truth of this for N + 2. 

n£ + 2 (N + 2 - 1) = N 2 + 4N + 4 = (N + 2 ) 2 

2 2 2 

N 2 - 1 + 2 (N + 2 - ]) = N 2 - 1 + < 1 N + 4 = N + 1) (N + 3) 

2 '2 '2 ' 

So that the ratio between the square of the total nur.be r of scgrculs in two 

^ interfacing structures and twice the limber of t renspo? it ions needed to eibc 

ERfC i 
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1 



V.'r-r) congru-jU ranges bof.wecn 0 and ]. Therefore : 



maxT(N) 



K~ 



and 



;? 

if maxT(N) = 



maxT(N- 1) 



N 2 -J ami 2 waxT(K-l) =■■ 1 

"*2" " ? - i 



o 3 

N 2 



n.ax(V) = 1 

V 

IHvT X (11} ■' 1 

n 



The supra-prosodic distance (I)} , therefore, between two inter- 
facing structures of Language A language X, is equal to twice the number of 
equalizing t rajisposi t ions (2T) over the square of their combined equivalent 

o 

segments (N") pits the difference between the constituent elements (V} , plus 
tiiv total graphic components minus the number of identical pairs, each set 
ranging in value between 0 and 1. 



D(s 



4 

ov 



2T 



V 

v 



U 

n 



These integrated measures must therefore range between 0 and 3: 




u = n - P 

But n n . Therefore the integrated formula for static 

diffcjences in texts analysed above the prosodic level must her 




S . 1.2 Inf ra - p r os o die Se gr outs 

If interlingual distance' differences below the prosodic level 
arc to be included, we nust measure* features that are acoustic rather than 
visual and state them in away that reflects their degree of similarities. 

O 
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The acoustic features to do v. 1th the urosodi c varishh s of the sy ] j ab J e ; 

the allophoiuc features, with differences in the proaunci atioii of phene :,vs in 
connected speech. This means that in the integrated formula, one term [the 
graph ic/j'.h cue rati c) will lone to le replaced by two (the sy 1 1 chi c and the 
phonetic) . 



5. 1.2. 1 Syndic Units 

In adding a masuro of prosodic distance to the ftnmuia, ci 
must apply procedures of syllabic measurement before working out tiK Aumula 
for its integration. 



Procedures of prosodic measurement, as already explained, 
include the interfacing of the syllables of equivalent speech segments for 
optimal correspondence and the marking of cod; syllable for ihylhn (force and 
length) and intonation (height and direction). linrh of these four sound com- 
ponents is measured on a scale of iv Native values for force (f), length (1 ), 



direction (d) and height (h ) (See_;1.£. 0 above ) . 



For example, take the homogriph Muni_cl^ as pronounced in 
comparable contexts in* F/iglish r d in French. First devide this strctcli of 
speedi into syllables and mark each for relative length, force, height and 
direction. 



V7T 




mju 


ink 


l J 


LJ 1 



7 


1 


l 

ry 


»_ 1 

J nik ] 



h. 


3 


2 


f 


1 


0 


d 


Jv 


2 


1 


0 


0 














h 


3 


1 


f 


jJ 


M 


d 


0 


1 


1 


°! 


1 
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Wive the proper mJii'Ji ■ c:il val u:s t> the mr-nked hi fforoji 



End treat each as a prosodic vector - subtract \ ng one fron the other in 
p os 1 1 i vo i n to ge lr, . For c x m:;p 1c : 




f 3 d h 



2 2 13 



110 3 



1 :1 +1 h 0 



L£. nil-: 



0 0 0 1 



■- 1 - 
.hi 

1\ nU: 



2 10 2 



r - 2 < 1 +0 +1 = -1 

7 

Then add the vectorial di ffoTcuc.es for cadi pair of syllables 
(3 + 4 ) , to ob i ain the p r os odi c di s t mice ( 7) . 

In brief, the prosodic distance between two equivalent, 
stretches of speech is calculated by taking the sum ( £) of the difference 
between cadi juxtaposed syllable (S) at each position (q) until all syllabic 
positions (Q) have been covered. That is: 



mul a. lliis can be done by expressing the syllabic distance as a percentage of the 
limit (h) of the rexiium (max), which is 12. 

O 





In the above example, Q = 2 syllabic positions, = 3 and - A. 

Q Q 

r 7 = 3 + 4 7 




The next problem is to integrate this into the genera 1 for- 
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Table 11 





SI (AS A\h 


i.Yl ■’ '.OU 


1011 INTIXUM 


- Mh\Sl^h 


* 


Q = 


total of juxtaposed 


syllable 


s k ^ 


total of iu 


;:i a posed ;A 0 : 1 c ires 




( i m: 1 , e vy t y appc> s i t i on ) 




(in cl . er.pt; 


y opposition) 


q = 


a given sylJnblic po 


si ti on 


k * 


a given phe 


;, n: posit ic.i 


S - 

q 


s y 1 1 a’ > i c. d 3 f f e re n c.e 


tit position q Pj,~ 


phO'iCHt' (7 if 


ft ur.ee a t p 0 s i t i 0:1 k 

1 


L = 


up p o r d 1 f 1C: re n " i a 1 p .i v >s o d i c 


limit B = 


upper dif. 


■ ■ n t i a 1 ph » n e rto hrundniy 


1 a 


lower di fie rent i cl prosodic 


limit b : - 


lew 1 or diffc 


lential phener*. v ouad-:. : ‘ 

1 


U « 


units difference 




u ~ 


mu her of u 


nits 


V * 


con slit ue n 1 di f fe i an ce 


v = 


mu her o f * 


cr.s ti tuents 


t' = 


common category 




C ” 


nur-ho of ^ 


:v-on constituents j 


N = 


number of segments 




n = 


r v hc i of o 


loronts | 


T - 


trails p os i tier* 




P K 


mu her of i: 


r.tehed pair’s j 

i 






f 


\ ^ 








\ »1 3 X 


v c 


0 = 


fv _ 
,r V> - 

Kc i 


BO 






i xc ..»<•- 

M : 5 


n / ./ \ 

J ' ’i 1 


l A Vo>i 




Th at is, 1 


the maximal possible 


dif fore nee 


, .1 the ] i rc:‘ 0 d i c fo at ti : 1 s 


of 


any two s tret dies of 


speech 


(I/Q) is equal 


to tlie tot 


1 n ur.fr cr of syllable. 



($) tines the di ffcrcntia 3 limit (10, or maximum difference between two 



syllables. This Maximum must be nado equal to 1 if the i ^odic formula is 

■ -A" s l - * 

Tq vf=iA ) 



to be integrated. So that, 



TJiis will have to be added to the infra-prosodi c kinct.iv r .^n ula as a new term 
expressing the prosodic differences in the pronunciation of .y two stretdies 
of speech as being equal to the sun of the di ffe ivnct .> between all in' rfaced 
syllables f over the differential prosodic lirdt (L) tin's the total 



O 
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nurber of sylln L-.I cj (Q) , that i 




Since psoition in a sentence is sometimes a tenet Lon of intonation and rhythm, 
it nay lu: useful to repeat the calculations by juxtaposing entire sentences 
as s eq uc n c c s of r.y 1 3 ah le s . 



syllables can be further i-ensurcd. It is at this level that the effects oi 
cat en.’fL i on , often anal)’ sc cl under the prosodic constituents of speech, can be 
taken into account, 

for example, the french/Hn f.l isli homograph cfliservat i on 
could be transcribed, segmented and interfaced thus: 



transcription of the two interfacing texts. Under each interfacing pair, the 
vectorial differences are indicated; in the examples, these will be taken from 
the table of phoncro distances ( See T a ble 8) . The next step is to take the 
sum of these distances as cxpixisscd in the formula: 



where K is the total nuiibcr of positions; k, any given position, and P^ , the 
vectorial distance between any two phone r-es interfaced at position k. Using 
a previous example for purposes of comparison: 

O 



5 . 1. 3. 2 Phoneme Uni ts 



Distance between the phonetic constituents of the 




/ 



Preliminary to any such measurement is an adequate 
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a = j; 



/ i: 



lw, ] 



1; 



F. l> j 



H. b T & r .l 



0 + 2 + 2 +4 



8 



> 



P + P + P + P 




To integrate this measure into lie general foirruln, n i i possible 



results must be Titled between zero and one. Tins neons th i t'o cnlcul aliens 
vvnst be made within boundaries, a lower boundary (b) ami an upper boui.dej/ (B) . 

If two phononjs are phonetically identical, like the Preach and the English 
/n/> the voctorirl distance between their pronunciation is zero, which is the 
least , or lower boundary (b) , and b " 0. The two phonemes of the interfaced 
phonetic codes winch difJer the r.ost will have reached the upper boundary of 
possible difference between a phoncr.o in one language raid a phoneme in the other 
language. In the case of Trench and English, it would, according to our table, 
be 17.5; but since in actual stretches of speech there can be context conditioned 
length differences, we must add another point to take such possible diffo'ences 
into account. So that T> /J : .~l;./ = 18.5. 



of any two equivalent stretches of speech is equal to its length j.n the total 
number of positions (K) tires the maximum difference between any tw»o phoneros 
(maxP^) , each taken from one of the languages being compared. So that, 



f lhe maximum difference possible (BK) in the phonetic features 



BK 



JL^ 

> . paxP 



K * 1 



In order to fit into our formula, this maxi run must be mace 
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equivalent, to cue. Mo I'hnt I»K 



' j 



The phonetic di ffc re n cc between the yronunciat ion of any 

two stretches of speech is eqtvJ to the tot cl iwrboT of inter feci nr 

/ \ 

phonemes ( b ) (including nul opposites ) t lives their interfaced, dif- 



ferences (['•} over their maxim;*] vectorial distances (B) , fim-s the mr.bcr of 
positions (K) . That is, j * Ar'lvlnr this to the above 

i. • i K ‘ 

bk i:p 

example, whore K-=4, P, - 8, B = 38.S, we yet r. ji. Jt - S - 8 = .ICS 

13 K >■•■■'••' : > r 

’]hc general formula extende d below the phono: e level would 
the re f o re require four t c r ms : 

2 T \ 



1 . 0 < f ™ 



V N 



,2 



N“ - 1 ■ 



2. 0 < 

\ u 



3 . 



■ < fiTAV. 



d. 




Since cadi of the above taupe? between nero and one, the 
integrated formula must represent a ran pa between aero and four. Here then 
is the integrated formula for static distance, extending below the phertero 
level between any two equivalent s tret dies of speech: 

~ ' v! 4 - -<r + -taa gu: 

* N u L Q. 1/) N 

I :R?C *( h- ) i ?> 

L..» J \ pi ■ "w 11 r 1 
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5 . 2 K i : i c 1 3 o M: ■ ,* . r > i 1 re t *:-i si 

In the integral r.reesmvs of discourse distance, the interfacing 
texts have been con:- i dared ns if they were two linos in space, segmented so 
that the equivalents cm be matched and jammed. In this static type of 
mosuvu.vant , the live dermr. of corrauni ent i on v;ns not taken into con si der- 
ail on. 

If the tin.;? element, or more exactly, the order of appearance of 
the constituent units, is trf.cn into account, the sonants take on the 
character of sequences. In finding the difference between tv;o Matching 
sequences of discourse, the interfacing constituents can be measured ns being 
either the same or different, or ns differing >n degree. 

In the first case, being above the prosodic (sijpra-prosodic) level, 
differences nix; discovered through the graphic i op resent at ions of the texts - 
either orthographic or phono mat 3 c, As vc have seen, this graphic representa- 
tion of speech may be limited to the sequence of phone nos ; or it may be n 
very narrow transcription of the speech sounds* 

5.2.1 Suprn - Pr os od i c Segue n ces 

The interfacing texts appear with elements arranged in a 

certain order; and since spatial order con be treated as analogous to temporal 

order, the same transposition formula already established for structural 

2T 

di s t an ce can be use d , th at is, —c 

w 

What must now be consi do red is not only what equivalent elements 
arc identical, but where - or rather, when. That is, at what point in the 
time sequence does the identity appear? In other words, not only the number 
of positions but also their sequential order must be taken into accovmt. 

Suppose, for example, that one were to measure the kinetic 



O 
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distance bcu.’.A'ii the French word foir n'Les and the English (N. American) 
equivalent forL.aKq in phonemotic (broadest) transcription: 

1\ f o re m y 1 (a) 

r. f 3 - lil ju 1 3 Z (x) 

(a < >0- 

Examining tlie above interfaced transcriptions, we note the follorjrq 

' 1. Tlie French form takes op six posit ions , whereas the 
English equivalent takes up nine. 



2 . Of tlie Maximum of nine positions, six have interfacing 
equivalents; and three /j,°,s/have non o, lot us label these 
tlie group K, so that K - 

3. Of tlie positions filled with a pair of interfacing 

phoneme*, , some are the same and some are different. lot 
us label this latter group L, so that L = | , 

4. Four of the phonemes as hero represented are identical in 
both stretches of speech. This is the group ^f,o,m,]j , and 
it is obviously equal to zero, |f,o,rn,l^ 
differences need be measured. 



0 . Now the 



o 

ERIC 



5. Ke note that R/^ occupy Position 3, and y/u occupy 

Position 6. So that 1 or L = ^ , 1^ ^ 

6. Each of tlicsc positions of the different interfacing 

units (a + x) , i »e. (6 + 9) can be expressed as an average 

a + x - 6 + 9 = IS. 

2 ~ 2 2 



2 



129 



127. 



7, h'c cm express li:o value of the common denominator 
by a factor cqnival'-nL to the mu her of interfacing 
pairs (2) in the sequence, i*e. f 1 1 ) by considering them 

ns exponents the values of which are reduced to 2 powers, 
e.g. (3-2, 6-2), giving us, 2^"'* . . . . or no re generally, 

2*'“^. Using this variable denominator for each uni den t i cn] 
pair thus gives values will ch decrease according to the dis- 
tance of the unit fro; -a the first position, in this case, from 
the beginning of the word. So that R/4 ~ 15/2"“ , and y/u - 

15/2^“^, or generally a j- x . 



8. booking at the first (0,x) sot of words without opposites, 
j,e,z| , or set K, occupying positions , we need 

only vary the formula by reducing the power of t lie exponent by 



one, so that 2 



k-1 



becomes the denominator. Therefore, 



= 2 5 ' 1 , = 2 8 ' 1 , ^0/z | - 2 9 ~ l , or generally £ +_3 



9. Tlie general formula for the kinetic distance between two 
equivalent interfacing sequences O'O is therefore; 

a + x + 




£ 



k c K 



.k-1 



JLJL2L 
re l 2 1-2 



So that 0 + 9 + 6+9 + 6+9 + 6 + 9 A 6 + 9 
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V u get. rot i d?a of ho: the distances obtained through such ro;vsur.?- 



mcnt may differ from those obtained through st at 3 c roersuromer.ls and also 
between stretches of speech in orthographic and phonetic t re n script 5 cv. , let 
us apply this formula to the above examples of F. hi ere and E. beer: 





a + x 





dli 



a +_x 

*" „ o 
2 * 



5 + 4 + 5_+ 4 

’ 2 Z-2 2 T-2‘ 



k c K 



a_ + 
A 

/T 



X 

T 



/ ; 

.(fb 



+ X 

A-J 



= .SC + 9.00 + 

V; s 10.01 

- \ beer / 



.45 = 



9 + 9 + 9 - 9 + 9 *9 = 

7 ”7 T r T * “ T 
10.01 



The same formula applied to a phonetic (narrow) transcription of these two 
words would show a greater difference, corresponding to the different pro- 
nunciations of the (hr) . It would show: (bj£’.»/ b J ^ .1 ) 

Li 12 h3 M 

a = b j [; tl =4 



x = b I 



W- 



T ti. .yr_ 

ir K z*' 1 fd , 2: ‘ 



a + x = S 



'• ->V 'y ’4 



0 + 4 + 4 4 4*4 + 4 + 4 - 



, 2-2 



.3-2 



,4-2 




d + G + _8 = ^ + 1 + ^ 

2 ^ 2 ^ 2 ^ ^ ^ 4 



8+ 4 -+ 2 = 14. 
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To find the kinetic cT ffe tenccs between t wo sequences of words, 0 : 1 c 1 
in us t s i lip 1 y a del the di f f c j on c 0 s 1 » c i - 1 . ? ecu e a eh i n t e r f a e i r . g p air. 11; e d i. f To re* nee 



for a t c -\t of (]) pairs is the su:-li of the individual di f icronce.*' . Thai is, 
it x 



y; 

X r J 



>; >: *-**■ + >: >: yen 

X r ■/ .jf/tK ^ -<■ = i -tC. L tv. 



5,2.2 Infra-u rosodi c Sentiences 



liven though the t rouscripti 0:1 of the interfacing stretches 
of speech should be most narrow, the amount of detail needed to differentiate 
tlie sounds may not be sufficient. This is especially likely in the measure- 
ment of differences between closely related di elects and also in the study 
of bilingual interference. He low the graphic level of differentiation 
sounds can be broken down into distinctive fc;r. mvs, each, of which is scaled, 



as has been done for the in f?a -prosodic static measurements ( See T abl es 0 , 

1 and ft ) . 

The difference here, however, is that the actual vectorial 
value of t: lie distance between two interfacing sounds depends on its position 
in the sequence, which has been established as a minimal equivalence. As 
in the kinetic measurement of interfacing segments of graphic syrbols 
(orthographic or phonetic) the values given to the differences are dec re men tal - 
they get smaller as the sequence advances in tiro. Here ,h owe ver, the differ- 
ences must be recorded, not as same- or- di ffe rent symbol, but as a degree of 
similarity or difference. This is obtained by subtracting distinctive features. 

The treatment of these two types of measurement must there- 
fore be somewhat different. Symbols that are the sane, different, or without 
opposites need no longer be taken into account, since those representing 
sounds with identical features will have a distance of zero, those that arc 
different will shew a positive vectorial value, and those without opposites 
r n I r ay be given either the value resulting from the sum of their distinctive 
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